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PREFACE 



Despite our attempts to be rigorously selective, the volumes of the 
Course of Theoretical Physics grow larger at every revision. Of course, 
this is a natural and inevitable consequence of the rapid development 
of science, but nevertheless the books become thereby less and less 
convenient for use by students, and indeed by anyone other than 
those whose profession is theoretical physics. 

Faced with this situation, Landau was eagerly considering, in the 
years preceding his terrible accident, the idea of producing a shorter 
course of theoretical physics based on the complete course. He felt 
that the shorter course should comprise the minimum of material 
that should be familiar to every present-day physicist, working in no 
matter what branch of physics. The tragedy of 1962 prevented Landau, 
my teacher and friend, from himself taking part in putting this plan 
into effect, and the first volume of the course did not appear until 
after his death. 

This shorter course will consist of three volumes: 

1. Mechanics and Electrodynamics 

2. Quantum Mechanics 

3. Macroscopic Physics 

The present Volume 1 is essentially a careful abridgement of our 
Mechanics and The Classical Theory of Fields. I have tried to take 
account of all the views expressed by Landau in our early discussions 
of the plans for these books. In particular, he considered that such a 
shorter course should not include any discussion of the general theory 
of relativity. In his opinion, the fundamental physical ideas and results 
of that theory ought to be described in courses of general physics, 

ix 



while the study of its complete mathematical treatment is necessary 
(at least for the present) only for theoretical physicists. 

The remaining content of Volumes 1 and 2 of the Course of Theoret- 
ical Physics has been reduced to about one-half. Since the shorter 
course is not intended to provide a professional knowledge of every 
technique in theoretical physics, only a small number of problems 
have been retained as simple illustrations. 

May I take this opportunity to express my sincere thanks to Dr. 
Sykes and Professor Hamermesh for all that they have done in making 
our books known to English-speaking students of physics. The English 
editions of the books began with Professor Hamermesh's translation 
of The Classical Theory of Fields, and were continued by those of 
Dr. Sykes and his colleagues. His prolonged labours have been largely 
responsible for the carefulness and accuracy of the translations, and 
his comments have on many occasions helped us to make corrections 
for the English versions. 



E. M. Lifshitz 



TRANSLATOR'S NOTE 



The work of Professor Hamermesh and myself as translators of this 
book has consisted largely in selecting the necessary passages from 
our previous translations of Volumes 1 and 2 of the same authors' 
Course of Theoretical Physics. Each of us having done this, I then 
sought to bring the two constituent parts into harmony by eliminating 
the major differences in notation, orthography and style. 

For my own part I wish to acknowledge the valuable advice of 
Dr. J S. Bell and Dr. J. L. Beeby. 

J.B.S. 
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NOTATION 



Mechanical quantities 

Generalised coordinates and momenta q t , p t 
Lagrangian and Hamiltonian functions 

(Part I) L, H 

(Part II) L, 76 
Particle energy and momentum 

(Part I) E, p 

(Part II) £, p 
Angular momentum M 
Moment of force (torque) K 
Inertia tensor I ik 

Electromagnetic quantities 

Electromagnetic field scalar and vector potentials <j>, A 
Electric and magnetic field intensities E, H 
Charge and current densities q, j 
Electric and magnetic dipole moments d, m 

Mathematical notation 

Volume, surface and line elements dV, df, dl 

Three-dimensional vector and tensor indices are denoted by Latin 

letters /', k, I, . . . , which take the values x, y, z. 
Four-dimensional vector and tensor indices are denoted by Greek 

letters A, jx, v, . . ., which take the values 0, 1,2, 3. 
The rule for raising and lowering four-dimensional indices is given 

on page 130. 

The rule of summation over repeated (dummy) indices is given on 
pages 81 and 130. 
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CHAPTER 1 



THE EQUATIONS OF MOTION 

§1. Generalised coordinates 

One of the fundamental concepts of mechanics is that of a particle. 
By this we mean a body whose dimensions may be neglected in de- 
scribing its motion. The possibility of so doing depends, of course, on 
the conditions of the problem concerned. For example, the planets 
may be regarded as particles in considering their motion about the 
Sun, but not in considering their rotation about their axes. 

The position of a particle in space is defined by its position vector r, 
whose components are its Cartesian coordinates x, y, z. The derivative 
v = dr/d/ of r with respect to the time / is called the velocity of the 
particle, and the second derivative d 2 r/d^ is its acceleration. In what 
follows we shall, as is customary, denote differentiation with respect 
to time by placing a dot above a letter: v = f . 

To define the position of a system of N particles in space, it is neces- 
sary to specify N position vectors, i.e. 32V coordinates. The number of 
independent quantities which must be specified in order to define uni- 
quely the position of any system is called the number of degrees of free- 
dom; here, this number is 3N. These quantities need not be the Carte- 
sian coordinates of the particles, and the conditions of the problem 
may render some other choice of coordinates more convenient. Any 
s quantities q v q 2 , . . . , q s which completely define the position of 
a system with s degrees of freedom are called generalised coordinates 
of the system, and the derivatives q\ ,are called its generalised velocities. 

3 



4 



The Equations of Motion 



§2 



When the values of the generalised coordinates are specified, how- 
ever, the "mechanical state" of the system at the instant considered 
is not yet determined in such a way that the position of the system 
at subsequent instants can be predicted. For given values of the coordi- 
nates, the system can have any velocities, and these affect the position 
of the system after an infinitesimal time interval dt. 

If ail the coordinates and velocities are simultaneously specified, it 
is known from experience that the state of the system is completely 
determined and that its subsequent motion can, in principle, be calcu- 
lated. Mathematically, this means that, if all the coordinates q and 
velocities q are given at some instant, the accelerations q at that instant 
are uniquely defined.* 

The relations between the accelerations, velocities and coordinates 
are called the equations of motion. They are second-order differential 
equations for the functions q(f), and their integration makes possible, 
in principle, the determination of these functions and so of the path 
of the system. 



§2. The principle of least action 

The most general formulation of the law governing the motion of 
mechanical systems is the principle of least action or Hamilton's 
principle, according to which every mechanical system is characterised 

by a definite function Uq^ q 2 , q s , q v q 2 <is> 0. or briefly 

L(q, q, t), and the motion of the system is such that a certain condition 
is satisfied. 

Let the system occupy, at the instants h and h, positions defined 
by two sets of values of the coordinates, q w and q (2) . Then the con- 
dition is that the system moves between these positions in such a way 
that the integral 

h 

S= J L(q,q,t)dt (2.1) 

h 



t For brevity, we shall often conventionally denote by q the set of all the coordi- 
nates ?i, q t q„ and similarly by q the set of all the velocities. 
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takes the least possible value. The function L is called the Lagrangian 
of the system concerned, and the integral (2.1) is called the action. 

The fact that the Lagrangian contains only q and q, but not the 
higher derivatives q, q, etc., expresses the result already mentioned, 
that the mechanical state of the system is completely defined when 
the coordinates and velocities are given. 

Let us now derive the differential equations which solve the problem 
of minimising the integral (2.1). For simplicity, we shall at first 
assume that the system has only one degree of freedom, so that only 
one function q(t) has to be determined. 

Let q = q(t) be the function for which S is a minimum. This means 
that S is increased when q(t) is replaced by any function of the form 



where 8q(t) is a function which is small everywhere in the interval of 
time from t± to tz\ dq(t) is called a variation of the function q(t). 
Since, for t = ti and for t = t 2 , all the functions (2.2) must take the 
values q a) and q i2) respectively, it follows that 



When this difference is expanded in powers of dq and 8q in the inte- 
grand, the leading terms are of the first order. The necessary condition 
for S to have a minimum is that these terms (called the first variation, 
or simply the variation, of the integral) should be zero. Thus the prin- 
ciple of least action may be written in the form 



q(t)+dq(t), 



(2.2) 



bq(h) = dq(h) = 0. 
The change in S when q is replaced by q+dq is 



(2.3) 



J L(q+dq, q + 6q,t)dt-f Uq, q, t)dt. 




5S = d j Uq, q, t)dt 



0, 



(2.4) 



or, effecting the variation, 
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§2 



Since bq = ddq/dt, we obtain, on integrating the second term by 
parts, 

«-[£*]>/(4rT,£)**-* <"> 

'I 

The conditions (2.3) show that the integrated term in (2.5) is zero. 
There remains an integral which must vanish for all values of dq. This 
can be so only if the integrand is zero identically. Thus we have 

d/8LX 9 L =0 
dt \ dq J dq 

When the system has more than one degree of freedom, the s 
different functions q£t) must be varied independently in the principle 
of least action. We then evidently obtain s equations of the form 

d /dL\ 3L n , „ . ^ 

d,U)-^° o- 1 ' 2 ' •••'*)• < 2 - 6 > 

These are the required differential equations, called in mechanics 
Lagrange's equations? If the Lagrangian of a given mechanical 
system is known, the equations (2.6) give the relations between acceler- 
ations, velocities and coordinates, i.e. they are the equations of motion 
of the system. 

Mathematically, the equations (2.6) constitute a set of s second- 
order equations for s unknown functions q£t). The general solution 
contains 2s arbitrary constants. To determine these constants and 
thereby to define uniquely the motion of the system, it is necessary 
to know the initial conditions which specify the state of the system 
at some given instant, for example the initial values of all the coordi- 
nates and velocities. 

Let a mechanical system consist of two parts A and B which would, 
if closed, have Lagrangians L A and L B respectively. Then, in the limit 



T In the calculus of variations they are Euler's equations for the formal problem 
of determining the extrema of an integral of the form (2.1). 
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where the distance between the parts becomes so large that the inter- 
action between them may be neglected, the Lagrangian of the whole 
system tends to the value 

UmL = L A +L B . (2.7) 

This additivity of the Lagrangian expresses the fact that the equations 
of motion of either of the two non-interacting parts cannot involve 
quantities pertaining to the other part. 

It is evident that the multiplication of the Lagrangian of a mechan- 
ical system by an arbitrary constant has no effect on the equations 
of motion. From this, it might seem, the following important property 
of arbitrariness can be deduced : the Lagrangians of different isolated 
mechanical systems may be multiplied by different arbitrary constants. 
The additive property, however, removes this indefiniteness, since it 
admits only the simultaneous multiplication of the Lagrangians of all 
the systems by the same constant. This corresponds to the natural 
arbitrariness in the choice of the unit of measurement of the Lagran- 
gian, a matter to which we shall return in §4. 

One further general remark should be made. Let us consider two 
functions L'(q, q, t) and L(q, q, t), differing by the total derivative 
with respect to time of some function f(q, i) of coordinates and time: 

L'(q, q, t) = L(q, q, t)+~ ( f(q, t). (2.8) 
The integrals (2.1) calculated from these two functions are such that 

'j h <t 

S' = J L'(q, q, t) dt = | L(q, q, t) dt+ | ^dt 

h h t t 

= S+f(qV\ h)-f(q«\ h), 

i.e. they differ by a quantity which gives zero on variation, so that the 
conditions dS' = 0 and 8S = 0 are equivalent, and the form of the 
equations of motion is unchanged. Thus the Lagrangian is defined 
only to within an additive total time derivative of any function of 
coordinates and time. 
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§3. Galileo's relativity principle 

In order to describe natural processes, it is necessary to choose a 
frame of reference. This is a coordinate system which serves to indicate 
the position of particles in space, together with a clock fixed to the 
system and serving to indicate time. The laws of nature, including the 
laws of motion, are in general different in form for different frames 
of reference. When an arbitrary frame of reference is chosen, it may 
happen that the laws governing even very simple phenomena become 
very complex. The problem naturally arises of finding a frame of 
reference in which the laws of nature take their simplest form. 

The simplest type of motion is that of a free particle, i.e. one which 
is not subject to any external interactions. There exist frames of refer- 
ence in which free motion takes place with a velocity which is constant 
in magnitude and direction. These are called inertial frames, and the 
statement that they exist is the law of inertia. 

The inertial property may also be formulated by stating that space 
is homogeneous and isotropic, and time homogeneous, relative to 
such a frame of reference. The homogeneity of space and time signifies 
that all positions of a free particle in space at all times are equivalent ; 
the isotropy of space, that different directions in space are equivalent. 
These properties evidently imply that free motion of a particle in any 
direction in space is an unchanging motion. 

If two frames of reference are in uniform relative motion in a 
straight line, and one is an inertial frame, the other frame is obviously 
also an inertial frame, since any free motion in it takes place with 
constant velocity. Thus there is an infinity of inertial frames moving 
with constant relative velocities. 

It is found, in fact, that different inertial frames are equivalent not 
only as regards the properties of free motion. Experiment shows 
that the relativity principle is valid : all the laws of nature are the same 
in every inertial frame. That is to say, the equations which express 
the laws of nature are invariant with respect to transformations of the 
coordinates and time from one inertial frame to another. This means 
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that the equations have the same form when written in terms of the 
coordinates and time in different inertial frames. 

Classical mechanics* or Newtonian mechanics is founded not only 
on the relativity principle but also on the hypothesis that time is 
absolute, i.e. that time runs in the same manner in every inertial frame. 
When combined with this hypothesis, the relativity principle is called 
Galileo's relativity principle. 

The coordinates r and r' of a given point in two different inertial 
frames of reference K and K', of which the latter moves relative to 
the former with velocity V, are related by 

r = r'+Vf. (3.1) 

Here t, the time, is the same in the two frames : 

t = t'. (3.2) 

Differentiation of equation (3.1) with respect to time gives the custom- 
ary law of composition of velocities: 

v = v'+V. (3.3) 

Formulae (3.1) and (3.2) are called a Galileo transformation. Galileo's 
relativity principle states that the laws of nature are invariant under 
this transformation. 

The foregoing discussion indicates quite clearly the exceptional 
properties of inertial frames, and consequently these frames should 
usually be employed in the investigation of mechanical phenomena. 
Unless otherwise stated, it will be implied henceforward that an 
inertial frame is used. 

The complete physical equivalence of all inertial frames shows also 
that there is no "absolute" frame of reference which should be pre- 
ferred to all other frames. 



t As distinct from relativistic mechanics or Einsteinian mechanics, which will be 
discussed in Chapters 8 and 9. 
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§4. The Lagrangian for a free particle 

Let us now go on to determine the form of the Lagrangian, and 
consider first of all the simplest case, that of the free motion of a 
particle relative to an inertial frame of reference. 

Because of the homogeneity of space and time, the Lagrangian of a 
free particle cannot depend explicitly on either the position vector r 
or the time t, i.e. L is a function of the velocity v only ; because of the 
isotropy of space, the Lagrangian must also be independent of the 
direction of the vector v, and is therefore a function only of its magni- 
tude, i.e. of v 2 = v 2 : 

L = L(v 2 ). 

The form of this function is uniquely determined by Galileo's 
relativity principle, which shows that Hp 2 ) must have the same form 
in every inertial frame of reference. When we change from one frame 
to another, the velocity of the particle is transformed in accordance 
with (3.3), and Lip 2 ) thus becomes L[(v' + V) 2 ]. The latter expression 
must therefore differ from L(p' 2 ), if at all, by the total derivative of a 
function of coordinates and time ; it has been shown at the end of §2 
that such a derivative can always be omitted from the Lagrangian. 

This condition is satisfied only by a function of the form 

L = ax 2 . 

The transformation v = v' + V gives 

Hp 2 ) = av 2 = a(v'+V) 2 
= av' 2 +2av'-\+aV 2 , 

or, since v' = dr'/df, 

L(v 2 ) = Uv' 2 )+~ (2ar'-\+aV 2 t). 

The additional terms do in fact form a total derivative and may be 
omitted. 
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The constant a is usually written as \m, and the Lagrangian of a 
freely moving particle thus becomes 

L = \mv 2 . (4.1) 

The quantity m is called the mass of the particle. The additive prop- 
erty of the Lagrangian shows that for a system of particles which 
do not interact we have* 

I = ZV.' ( 4 - 2 ) 

It should be emphasised that the above definition of mass becomes 
meaningful only when the additive property is taken into account. 
As has been mentioned in §2, the Lagrangian can always be multiplied 
by any constant without affecting the equations of motion. As regards 
the function (4.2), such multiplication amounts to a change in the 
unit of mass; the ratios of the masses of different particles remain 
unchanged thereby, and it is only these ratios which are physically 
meaningful. 

It is easy to see that the mass of a particle cannot be negative. For, 
according to the principle of least action, the integral 

2 

S = J \mv 2 At 

i 

has a minimum for the actual motion of the particle in space from 
point 1 to point 2. If the mass were negative, the action integral 
would take arbitrarily large negative values for a motion in which 
the particle rapidly left point 1 and rapidly approached point 2, and 
there would be no minimum. 
It is useful to notice that 

v 2 = (d//d/) 2 = (d/) 2 /(d/) 2 . (4.3) 
Hence, to obtain the Lagrangian, it is sufficient to find the square of 
the element of arc d/ in a given system of coordinates. In Cartesian 
coordinates, for example, d/ 2 = dx^+dj^+dz 2 , and so 

L = \m(x 2 +y 2 +z 2 ). (4.4) 

t We shall use the suffixes a,b,c, ... to distinguish the various particles, and i, 
t, /, . . . to distinguish the coordinates. 
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In cylindrical coordinates d/ 2 = dr 2 +r 2 d0 2 +dz 2 , whence 

L = lw(r 2 + r 2 0 2 + z 2 ). (4.5) 
In spherical coordinates d/ 2 = dr 2 +r 2 d0 2 +r 2 sin 2 0 d0 2 , and 

L = im(r 2 + r 2 d 2 + r 2 4> 2 sin 2 d). (4.6) 

§5. The Lagrangian for a system of particles 

Let us now consider a system of particles which interact with one 
another but with no other bodies. This is called a closed system. It is 
found that the interaction between the particles can be described by 
adding to the Lagrangian (4.2) for non-interacting particles a certain 
function of the coordinates, which depends on the nature of the inter- 
action^ Denoting this function by — U, we have 

L = I> B ^-J7(r 1 ,r„ ...), (5.1) 

where r fl is the position of the ath particle. This is the general form of 
the Lagrangian for a closed system. The sum T = S\m a v 2 is called 
the kinetic energy, and U the potential energy, of the system. The 
significance of these names is explained in §6. 

The fact that the potential energy depends only on the positions of the 
particles at a given instant shows that a change in the position of any 
particle instantaneously affects all the other particles. We may say 
that the interactions are instantaneously propagated. The necessity 
for interactions in classical mechanics to be of this type is closely 
related to the premises upon which the subject is based, namely the 
absolute nature of time and Galileo's relativity principle. If the prop- 
agation of interactions were not instantaneous, but took place with 
a finite velocity, then that velocity would be different in different 
frames of reference in relative motion, since the absoluteness of time 
necessarily implies that the ordinary law of composition of velocities is 
applicable to all phenomena. The laws of motion for interacting 



t This statement is valid only in classical mechanics. 
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bodies would then be different in different inertial frames, a result 
which would contradict the relativity principle. 

In §3 only the homogeneity of time has been spoken of. The form 
of the Lagrangian (5.1) shows that time in mechanics is both homo- 
geneous and isotropic, i.e. its properties are the same in both directions. 
For, if / is replaced by — / {time reversal), the Lagrangian is unchanged, 
and therefore so are the equations of motion. In other words, if a 
given motion is possible in a system, then so is the reverse motion 
(that is, the motion in which the system passes through the same states 
in the reverse order). In this sense all motions which obey the laws 
of classical mechanics are reversible. 

Knowing the Lagrangian, we can derive the equations of motion^: 

±*L = *L (5 2) 

dt dv a dr a ' 

Substitution of (5. 1) gives 

m a dvjdt = -dU/dr a . (5.3) 

In this form the equations of motion are called Newton's equations 
and form the basis of the mechanics of a system of interacting particles. 
The vector 

F a = -BU/dr a (5.4) 

which appears on the right-hand side of equation (5.3) is called the 
force on the oth particle. Like U, it depends only on the coordinates 
of the particles, and not on their velocities. The equation (5.3) therefore 
shows that the acceleration vectors of the particles are likewise func- 
tions of their coordinates only. 

The potential energy is defined only to within an additive constant, 
which has no effect on the equations of motion. This is a particular 
case of the non-uniqueness of the Lagrangian discussed at the end of 
§2. The most natural and most usual way of choosing this constant 



t The derivative of a scalar quantity with respect to a vector is defined as the 
vector whose components are equal to the derivatives of the scalar with respect to 
the corresponding components of the vector. 



14 



The Equations of Motion 



§5 



is such that the potential energy tends to zero as the distances between 
the particles tend to infinity. 

If we use, to describe the motion, arbitrary generalised coordinates 
q t instead of Cartesian coordinates, the following transformation is 
needed to obtain the new Lagrangian: 

Xa = fa(qi, q* ?»)> X a = £ 

Substituting these expressions in the function L — \Y, m a( x a+ya+ 
U, we obtain the required Lagrangian in the form 

L=i^a lk (q)q l g k -U(q), (5.5) 

where the a ilc are functions of the coordinates only. The kinetic energy 
in generalised coordinates is still a quadratic function of the velocities, 
but it may depend on the coordinates also. 

Hitherto we have spoken only of closed systems. Let us now consider 
a system A which is not closed and interacts with another system B 
executing a given motion. In such a case we say that the system A 
moves in a given external field (due to the system B). Since the equa- 
tions of motion are obtained from the principle of least action by 
independently varying each of the coordinates (i.e. by proceeding as 
if the remainder were given quantities), we can find the Lagrangian 
L A of the system A by using the Lagrangian L of the whole system 
A+B and replacing the coordinates q B therein by given functions of 
time. 

Assuming that the system A+B is closed, we have L = T A (q A , q A ) + 
+Tb(Qb> 4b)—U(.<1a-: where the first two terms are the kinetic 
energies of the systems A and B and the third term is their combined 
potential energy. Substituting for q B the given functions of time and 
omitting the term TXqJjt), q B (t)] which depends on time only, and is 
therefore the total time derivative of a function of time, we obtain 
L A = T A (q A , q A )-U[q A , q B (t)]. Thus the motion of a system in an 
external field is described by a Lagrangian of the usual type, the only 
difference being that the potential energy may depend explicitly on 
time. 
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For example, when a single particle moves in an external field, the 
general form of the Lagrangian is 

L = \mv*-U{r,t), (5.6) 

and the equation of motion is 

wv = -3C//3r. (5.7) 

A field such that the same force F acts on a particle at any point 
in the field is said to be uniform. The potential energy in such a field is 
evidently 

l/ = -F.r. (5.8) 

To conclude this section, we may make the following remarks 
concerning the application of Lagrange's equations to various prob- 
lems. It is often necessary to deal with mechanical systems in which 
the interaction between different bodies (or particles) takes the form 
of constraints, i.e. restrictions on their relative position. In practice, 
such constraints are effected by means of rods, strings, hinges and so 
on. This introduces a new factor into the problem, in that the motion 
of the bodies results in friction at their points of contact, and the 
problem in general ceases to be one of pure mechanics (see §20). In 
many cases, however, the friction in the system is so slight that its 
effect on the motion is entirely negligible. If the masses of the constrain- 
ing elements of the system are also negligible, the effect of the con- 
straints is simply to reduce the number of degrees of freedom s of 
the system to a value less than 3iV. To determine the motion of the 
system, the Lagrangian (5.5) can again be used, with a set of independ- 
ent generalised coordinates equal in number to the actual degrees of 
freedom. 

PROBLEMS 

Find the Lagrangian for each of the following systems when placed in a uniform 
gravitational field (acceleration g). 

Problem 1. A coplanar double pendulum (Fig. 1). 

Solution. We take as coordinates the angles </>, and </> 2 which the strings l y and 
/ 2 make with the vertical. Then we have, for the particle m,, 7\ = ^mJi<pi, U = 
= -rriigli cos (f>v In order to find the kinetic energy of the second particle, we 
sxpress its Cartesian coordinates x t , y t (with the origin at the point of support and 
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Fig. 1 



the y axis vertically downwards) in terms of the angles cp 1 and cp 2 : x 2 — 
— /, sin cp, + / 2 sin cp 2 , y t — l t cos<£i+/t cos cp 2 . Then we find 

T 2 - \m t (x\+y\) 

- \m$\4>\ + /l<^l+2/ 1 / 2 cos (<h-<W<k<M. 

Finally 

Z. = ^(m 1 +m i )llcpl+^m s ll<pl+m 2 l 1 l t (p 1 <p\cos{(p 1 -(p 2 )+(m 1 + m 2 )gl 1 cos<£,+ 

+ mtgli cos <£ 2 - 

Problem 2. A simple pendulum whose point of support oscillates vertically 
according to the law a cos yt (Fig. 2). 

Solution. The coordinates of m are 

x = / sin <£, .y = / cos <£+fl cos yt. 

The Lagrangian is 

L = ±ml 2 (p 2 + mlay* cos y/ cos (p + mgl cos <£; 

here terms depending only on time have been omitted, together with the total time 
derivative of mlay 2 sin yt cos cp. 




m 
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CHAPTER 2 



CONSERVATION LAWS 



§6. Energy 

During the motion of a mechanical system, the 2s quantities q t 
and q i (i = 1, 2, . . ., s) which specify the state of the system vary 
with time. There exist, however, functions of these quantities whose 
values remain constant during the motion, and depend only on the 
initial conditions. Such functions are called integrals of the motion. 

The number of independent integrals of the motion for a closed me- 
chanical system with s degrees of freedom is 2s— 1 . This is evident from 
the following simple arguments. The general solution of the equations 
of motion contains 2s arbitrary constants (see the discussion following 
equation (2.6)). Since the equations of motion for a closed system do 
not involve the time explicitly, the choice of the origin of time is entirely 
arbitrary, and one of the arbitrary constants in the solution of the equa- 
tions can always be taken as an additive constant t 0 in the time. Eliminat- 
ing t+to from the 2s functions q,= qj(t+t 0 , C lt C 2 , C 2 ,_ 1 ), 
q t = q f {t+t 0 , C x , C 2 , . . ., C 2l _ 1 ), we can express the 2s— 1 arbitrary 
constants C v C 2 , . . ., C 2s _ 1 as functions of q and q, and these func- 
tions will be integrals of the motion. 

Not all integrals of the motion, however, are of equal importance in 
mechanics. There are some whose constancy is of profound signific- 
ance, deriving from the fundamental homogeneity and isotropy of 
space and time. The quantities represented by such integrals of the 
motion are said to be conserved, and have an important common 

17 
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property of being additive: their values for a system composed of 
several parts which do not interact are equal to the sums of their 
values for the individual parts. 

It is to this additivity that the quantities concerned owe their especial 
importance in mechanics. Let us suppose, for example, that two bodies 
interact only during a certain interval of time. Since each of the addi- 
tive integrals of the whole system is, both before and after the interac- 
tion, equal to the sum of its values for the two bodies separately, the con- 
servation laws for these quantities immediately make possible various 
conclusions regarding the state of the bodies after the interaction, if 
their states before the interaction are known. 

Let us consider first the conservation law resulting from the homo- 
geneity of time. By virtue of this homogeneity, the Lagrangian of a 
closed system does not depend explicitly on time. The total time 
derivative of the Lagrangian can therefore be written 

dL 9L . 9L .. 

If L depended explicitly on time, a term 8L/8f would have to be added 
on the right-hand side. Replacing 8L/8<7,-, in accordance with La- 
grange's equations, by (d/d?) dLjdq h we obtain 



or 




Hence we see that the quantity 

remains constant during the motion of a closed system, i.e. it is an 
integral of the motion; it is called the energy of the system. The 
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additivity of the energy follows immediately from that of the Lagran- 
gian, since (6.1) shows that it is a linear function of the latter. 

The law of conservation of energy is valid not only for closed sys- 
tems, but also for those in a constant external field (i.e. one independent 
of time): the only property of the Lagrangian used in the above 
derivation, namely that it does not involve the time explicitly, is still 
valid. Mechanical systems whose energy is conserved are sometimes 
called conservative systems. 

As we have seen in §5, the Lagrangian of a closed system (or one 
in a constant field) is of the form L = T(q, q)—U(q), where T is a 
quadratic function of the velocities. Using Euler's theorem on homo- 
geneous functions, we have 

Substituting this in (6.1) gives 

E = T(q,q) + U(q); (6.2) 
in Cartesian coordinates, 

£ = E> fl »2 + tf(ri,r a , ...). (6.3) 

a 

Thus the energy of the system can be written as the sum of two quite 
different terms: the kinetic energy, which depends on the velocities, 
and the potential energy, which depends only on the coordinates of 
the particles. 

§7. Momentum 

A second conservation law follows from the homogeneity of space. 
By virtue of this homogeneity, the mechanical properties of a closed 
system are unchanged by any parallel displacement of the entire 
system in space. Let us therefore consider an infinitesimal displacement 
e, and obtain the condition for the Lagrangian to remain unchanged. 

A parallel displacement is a transformation in which every particle 
in the system is moved by the same amount, the position vector r 
becoming r+e. The change in L resulting from an infinitesimal change 
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in the coordinates, the velocities of the particles remaining fixed, is 

where the summation is over the particles in the system. Since e is 
arbitrary, the condition dL — 0 is equivalent to 

£3L/3r a = 0. (7.1) 

a 

From Lagrange's equations (5.2) we therefore have 

d 3L = d 3L =Q 
V dt dv a dt V 3v a 

Thus we conclude that, in a closed mechanical system, the vector 

P = £3L/3v a (7.2) 

a 

remains constant during the motion ; it is called the momentum of the 
system. Differentiating the Lagrangian (5.1), we find that the momen- 
tum is given in terms of the velocities of the particles by 

P = Im a v a . (7.3) 

a 

The additivity of the momentum is evident. Moreover, unlike the 
energy, the momentum of the system is equal to the sum of its values 
p a = m a v a for the individual particles, whether or not the interaction 
between them can be neglected. 

The three components of the momentum vector are all conserved 
only in the absence of an external field. The individual components 
may be conserved even in the presence of a field, however, if the 
potential energy in the field does not depend on all the Cartesian 
coordinates. The mechanical properties of the system are evidently 
unchanged by a displacement along the axis of a coordinate which 
does not appear in the potential energy, and so the corresponding 
component of the momentum is conserved. For example, in a uniform 
field in the z direction, the x and y components of momentum are 
conserved. 
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The equation (7.1) has a simple physical meaning. The derivative 
8L/8r a = — 8C//8r a is the force F a acting on the ath particle. Thus 
equation (7. 1) signifies that the sum of the forces on all the particles 
in a closed system is zero : 

ZF a = 0. (7.4) 

a 

In particular, for a system of only two particles, Fi+F 2 = 0: the 
force exerted by the first particle on the second is equal in magnitude, 
and opposite in direction, to that exerted by the second particle on 
the first. This is the equality of action and reaction (Newton's third 
law). 

If the motion is described by generalised coordinates q t , the deriv- 
atives of the Lagrangian with respect to the generalised velocities 

Pi = dL/dq, (7.5) 

are called generalised momenta, and its derivatives with respect to the 
generalised coordinates 

Ft = dL/dq, (7.6) 

are called generalised forces. In this notation, Lagrange's equations are 

Pi = F t . (7.7) 

In Cartesian coordinates the generalised momenta are the components 
of the vectors p a . In general, however, the p, are linear homogeneous 
functions of the generalised velocities q,, and do not reduce to products 
of mass and velocity. 

§8. Centre of mass 

The momentum of a closed mechanical system has different values 
in different (inertial) frames of reference. If a frame K' moves with 
velocity V relative to another frame K, then the velocities \' a and v a 
of the particles relative to the two frames are such that v a = v a +V. 
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The momenta P and P' in the two frames are therefore related by 

a a a 

or 

P = P'+V5>„. (8.1) 

a 

In particular, there is always a frame of reference K' in which the 
total momentum is zero. Putting P' = 0 in (8.1), we find the velocity 
of this frame : 

V = P/5>„ = £ wj a v a /£ m a . (8.2) 

If the total momentum of a mechanical system in a given frame of 
reference is zero, it is said to be at rest relative to that frame. This is a 
natural generalisation of the term as applied to a particle. Similarly, 
the velocity V given by (8.2) is the velocity of the "motion as a whole" 
of a mechanical system whose momentum is not zero. Thus we see 
that the law of conservation of momentum makes possible a natural 
definition of rest and velocity, as applied to a mechanical system as a 
whole. 

Formula (8.2) shows that the relation between the momentum P 
and the velocity V of the system is the same as that between the mo- 
mentum and velocity of a single particle of mass [i = Em a , the sum 
of the masses of the particles in the system. This result can be regarded 
as expressing the additivity of mass. 

The right-hand side of formula (8.2) can be written as the total 
time derivative of the expression 

R = 5>A/£«- (8.3) 

We can say that the velocity of the system as a whole is the rate of 
motion in space of the point whose position is (8.3). This point is 
called the centre of mass of the system. 

The law of conservation of momentum for a closed system can be 
formulated as stating that the centre of mass of the system moves 
uniformly in a straight line. In this form it generalises the law of 
inertia derived in §3 for a single free particle, whose "centre of mass" 
coincides with the particle itself. 
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In considering the mechanical properties of a closed system it is 
natural to use a frame of reference in which the centre of mass is at 
rest. This eliminates a uniform rectilinear motion of the system as a 
whole. 

The energy of a mechanical system which is at rest as a whole is 
usually called its internal energy E r This includes the kinetic energy 
of the relative motion of the particles in the system and the potential 
energy of their interaction. The total energy of a system moving as a 
whole with velocity V can be written 

E=\f*V*+E,. (8.4) 

Although this formula is fairly obvious, we may give a direct proof of 
it. The energies E and E' of a mechanical system in two frames of 
reference K and K' are related by 

a 
a 

= \iiV* + V •£ m a y' a + 1 £ m a v'* + U 

Or a a 

E = E' +VP H-l^F 2 . (8.5) 

This formula gives the law of transformation of energy from one frame 
to another, corresponding to formula (8.1) for momentum. If the 
centre of mass is at rest in K', then P' = 0, E' — E t , and we have (8.4). 

§9. Angular momentum 

Let us now derive the conservation law which follows from the 
isotropy of space. This isotropy means that the mechanical properties 
of a closed system do not vary when it is rotated as a whole in any 
manner in space. Let us therefore consider an infinitesimal rotation 
of the system, and obtain the condition for the Lagrangian to remain 
unchanged. 

We shall use the vector dty of the infinitesimal rotation, whose 
magnitude is the angle of rotation d<j>, and whose direction is that of 
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the axis of rotation (the direction of rotation being that of a right- 
handed screw driven along dty). 

Let us find, first of all, the resulting increment in the position 
vector from an origin on the axis to any particle in the system under- 
going rotation. The linear displacement of the end of the position vector 



8<#> 




Fig. 3 

is related to the angle by |<5>r| = r sin 8 btj> (Fig. 3). The direction of 
br is perpendicular to the plane of r and dty. Hence it is clear that 

8r = dtyXr. (9.1) 

When the system is rotated, not only the position vectors but also the 
velocities of the particles change direction, and all vectors are trans- 
formed in the same manner. The velocity increment relative to a fixed 
system of coordinates is therefore 

(5v = dfyxv. (9.2) 
If these expressions are substituted in the condition that the Lagran- 
gian is unchanged by the rotation: 

*-?(£•*.+£•*.)-• 

and the derivative 8L/8v a replaced by p a , and 8£/8r a by p a , the result 
is 

£(p a .<5<|)Xr a -|-p a -<5<|>Xv 0 ) = 0 
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or, permuting the factors and taking Sty outside the sum, 
<5<|>-X(r a Xp a + v a Xp a ) = <5<|). ~'Zr 0 XPa = 0. 

a Of a 

Since <5<|) is arbitrary, it follows that (d/df) Zr a Xp a = 0, and we con- 
clude that the vector 

M = £r a Xp a , (9.3) 

a 

called the angular momentum or moment of momentum of the system, is 
conserved in the motion of a closed system. Like the linear momentum, 
it is additive, whether or not the particles in the system interact. 

There are no other additive integrals of the motion. Thus every 
closed system has seven such integrals : energy, three components of 
momentum, and three components of angular momentum. 

Since the definition of angular momentum involves the positions 
of the particles, its value depends in general on the choice of origin. 
The positions r a and r a of a given point relative to origins at a distance 
a apart are related by r a = r^+a. Hence 

M = £ r aXp a 

a 

= 5>«Xp« + «x£p. 

a a 

° F M = M' + aXP. (9.4) 

It is seen from this formula that the angular momentum depends on 
the choice of origin except when the system is at rest as a whole (i.e. 
P = 0). This indeterminacy, of course, does not affect the law of con- 
servation of angular momentum, since momentum is also conserved in 
a closed system. 

We may also derive a relation between the angular momenta in two 
inertial frames of reference K and K', of which the latter moves with 
velocity V relative to the former. We shall suppose that the origins 
in the frames K and K' coincide at a given instant. Then the position 
vectors of the particles are the same in the two frames, while their 
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velocities are related by \ a — \' a +V. Hence we have 

a a a 

The first sum on the right-hand side is the angular momentum M' 
in the frame K' ; using in the second sum the position of the centre 
of mass (8.3), we obtain 

M = M'+ i «RxV. (9.5) 

This formula gives the law of transformation of angular momentum 
from one frame to another, corresponding to formula (8. 1) for momen- 
tum and (8.5) for energy. 

If the frame K' is that in which the system considered is at rest as a 
whole, then V is the velocity of its centre of mass, [iV its total momen- 
tum P relative to K, and 

M = M' + RXP. (9.6) 

In other words, the angular momentum M of a mechanical system 
consists of its "intrinsic angular momentum" in a frame in which it 
is at rest, and the angular momentum RXP due to its motion as a 
whole. 

Although the law of conservation of all three components of angular 
momentum (relative to an arbitrary origin) is valid only for a closed 
system, the law of conservation may hold in a more restricted form 
even for a system in an external field. It is evident from the above 
derivation that the component of angular momentum along an axis 
about which the field is symmetrical is always conserved, for the me- 
chanical properties of the system are unaltered by any rotation about 
that axis. Here the angular momentum must, of course, be defined 
relative to an origin lying on the axis. 

The most important such case is that of a centrally symmetric field 
or central field, i.e. one in which the potential energy depends only 
on the distance from some particular point (the centre). It is evident 
that the component of angular momentum along any axis passing 
through the centre is conserved in motion in such a field. In other 
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words, the angular momentum M is conserved provided that it is 
defined with respect to the centre of the field. 

Another example is that of a field homogeneous in the z direction ; 
in such a field, the component M z of the angular momentum is con- 
served, whichever point is taken as the origin. 

The component of angular momentum along any axis (say the z 
axis) can be found by differentiation of the Lagrangian : 

M, = Z^-, (9.7) 

where the coordinate <f> is the angle of rotation about the z axis. This is 
evident from the above proof of the law of conservation of angular 
momentum, but can also be proved directly. In cylindrical coordinates 
r, <f), z we have (substituting x a = r a cos <p a , y a = r a sin </>J 

M z = £ m a (x a y a - y a x a ) 

a 

= lm a rti a . (9-8) 

a 

The Lagrangian is, in terms of these coordinates, 
L = \ Y,m a (fl + rl4>l + z%-U, 

a 

and substitution of this in (9.7) gives (9.8). 



PROBLEM 

Which components of momentum P and angular momentum M are conserved 
in motion in the following fields ? 

(a) the field of an infinite homogeneous plane, (b) that of an infinite homogene- 
ous cylinder, (c) that of an infinite homogeneous prism, (d) that of two points, 
(e) that of an infinite homogeneous half-plane, (f) that of a homogeneous cone. 

Solution, (a) P x , P v , Af 2 (if the plane is the xy plane), (b) Af„ P, (if the axis of the 
cylinder is the z axis), (c) P z (if the edges of the prism are parallel to the z axis), 
(d) M z (if the line joining the points is the z axis), (e) P v (if the edge of the half- 
plane is the y axis), (f) M z (if the axis of the cone is the z axis). 
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§10. Motion in one dimension 

The motion of a system having one degree of freedom is said to 
take place in one dimension. The most general form of the Lagrangian 
of such a system in fixed external conditions is 

L=\a{q)q*-V{q), (10.1) 

where a(q) is some function of the generalised coordinate q. In partic- 
ular, if q is a Cartesian coordinate (x, say) then 

L = fvwx 2 — U(x). (10.2) 

The equations of motion corresponding to these Lagrangians can 
be integrated in a general form. It is not even necessary to write down 
the equation of motion ; we can start from the first integral of this 
equation, which gives the law of conservation of energy. For the La- 
grangian (10.2) (e.g.) we have ~mx 2 +U(x) = E. This is a first-order 
differential equation, and can be integrated immediately. Since 
dxjdt = \/{2[E~ U(x)]lm}, it follows that 

dx 

+ constant. (10.3) 



» = V(t«) 



VIE- U(x)] 



The two arbitrary constants in the solution of the equation of motion 
are here represented by the total energy E and the constant of integra- 
tion. 
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Since the kinetic energy is essentially positive, the total energy 
always exceeds the potential energy, i.e. the motion can take place 
only in those regions of space where U(x) < E. For example, let the 
function U(x) be of the form shown in Fig. 4. If we draw in the figure 
a horizontal line corresponding to a given value of the total energy, 
we immediately find the possible regions of motion. In the example 
of Fig. 4, the motion can occur only in the range AB or in the range 
to the right of C. 

The points at which the potential energy equals the total energy, 

U(x) = E, (10.4) 

give the limits of the motion. They are turning points, since the velocity 
there is zero. If the region of the motion is bounded by two such 
points, then the motion takes place in a finite region of space, and is 
said to be finite. If the region of the motion is limited on only one 
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Fig. 4 
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side, or on neither, then the motion is infinite and the particle goes to 
infinity. 

A finite motion in one dimension is oscillatory, the particle moving 
repeatedly back and forth between two points (in Fig. 4, in the potential 
well AB between the points X\ and x 2 ). The period T of the oscillations, 
i.e. the time during which the particle passes from x x to x 2 and back, 
is twice the time from xi to x 2 (because of the reversibility property, 
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§5) or, by (10.3), 



*,(£) 



dx 



T(E) = V(2m) 



V[E-U(x)] ' 



(10.5) 



where xi and x 2 are roots of equation (10.4) for the given value of E. 
This formula gives the period of the motion as a function of the total 
energy of the particle. 



A complete general solution can be obtained for an extremely 
important problem, that of the motion of a system consisting of two 
interacting particles (the two-body problem). 

As a first step towards the solution of this problem, we shall show 
how it can be considerably simplified by separating the motion of the 
system into the motion of the centre of mass and that of the particles 
relative to the centre of mass. 

The potential energy of the interaction of two particles depends only 
on the distance between them, i.e. on the magnitude of the difference 
in their position vectors. The Lagrangian of such a system is therefore 



Let r = n— r 2 be the relative position vector, and let the origin be at 
the centre of mass, i.e. miri+m 2 r 2 = 0. These two equations give 
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L = \m 1 i\ + \m 2 it--U(\T l -T 2 \). 



(11.1) 



ri = w 2 r/(wi + w 2 ), r 2 = — Wir/(mi+m 2 ). 



(11.2) 



Substitution in (11.1) gives 



L = \mi*-V(r), 



(11.3) 



where 



m = wiw 2 /(ma + w 2 ) 



(11.4) 



is called the reduced mass. The function (11.3) is formally identical 
with the Lagrangian of a particle of mass m moving in an external 
field U(r) which is symmetrical about a fixed origin. 
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Thus the problem of the motion of two interacting particles is 
equivalent to that of the motion of one particle in a given external 
field U(r). From the solution r = r(t) of this problem, the paths 
ri = ri(0 and T2 = T2(t) of the two particles separately, relative to 
their common centre of mass, are obtained by means of formulae (11.2). 



On reducing the two-body problem to one of the motion of a single 
body, we arrive at the problem of determining the motion of a single 
particle in an external field such that its potential energy depends 
only on the distance r from some fixed point. This is called a central 
field. The force acting on the particle is F = -dU(r)/dr — — (dU/dr)r/r; 
its magnitude is likewise a function of r only, and its direction is 
everywhere that of the position vector. 

As has already been shown in §9, the angular momentum of any 
system relative to the centre of such a field is conserved. The angular 
momentum of a single particle is M = rXp. Since M is perpendicular 
to r, the constancy of M shows that, throughout the motion, the posi- 
tion vector of the particle lies in the plane perpendicular to M. 

Thus the path of a particle in a central field lies in one plane. Using 
polar coordinates r, </> in that plane, we can write the Lagrangian as 



see (4.5). This function does not involve the coordinate </> explicitly. 
Any generalised coordinate q t which does not appear explicitly in the 
Lagrangian is said to be cyclic. For such a coordinate we have, by 
Lagrange's equation, (d/df) dL/dq, = dL/dq, = 0, so that the corre- 
sponding generalised momentum p, = dL/dq, is an integral of the 
motion. This leads to a considerable simplification of the problem of 
integrating the equations of motion when there are cyclic coordinates. 

In the present case, the generalised momentum p^ = mr 2 <p is the 
same as the angular momentum M 2 = M (see (9.8)), and we return to 
the known law of conservation of angular momentum: 
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L = lm(r*+r*4*)-U(r); 



(12.1) 



M = mr 2 <j) = constant. 



(12.2) 
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This law has a simple geometrical interpretation in the plane motion 
of a single particle in a central field. The expression ~r-rd(j> is the 
area of the sector bounded by two neighbouring position vectors and 
an element of the path (Fig. 5). Calling this area df we can write 
the angular momentum of the particle as 

M = 2mf (12.3) 

where the derivative / is called the sectorial velocity. Hence the con- 
servation of angular momentum implies the constancy of the sectorial 
velocity : in equal times the position vector of the particle sweeps out 
equal areas {Kepler's second law).* 




Fig. 5 



The complete solution of the problem of the motion of a particle 
in a central field is most simply obtained by starting from the laws of 
conservation of energy and angular momentum, without writing out 
the equations of motion themselves. Expressing <j> in terms of M 
from (12.2) and substituting in the expression for the energy, we 
obtain 

E = lm(r 2 + rV 2 ) + U(r) = \mr* + \M*lmr* +U(r). (12.4) 

Hence 

-1 = 1^™-^.} <■"> 

or, integrating, 

' = f dr/lA{^[£-f/(r)]-~}+constant. (12.6) 



t The law of conservation of angular momentum for a particle moving in a 
central field is sometimes called the area integral. 
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Writing (12.2) as d<f> = M dt/mr 2 , substituting dt from (12.5) and 
integrating, we find 



Formulae (12.6) and (12.7) give the general solution of the problem. 
The latter formula gives the relation between r and <f>, i.e. the equation 
of the path. Formula (12.6) gives the distance r from the centre as an 
implicit function of time. The angle <f>, it should be noted, always 
varies monotonically with time, since (12.2) shows that <f> can never 
change sign. 

The expression (12.4) shows that the radial part of the motion can 
be regarded as taking place in one dimension in a field where the 
"effective potential energy" is 



The quantity M 2 /2mr z is called the centrifugal energy. The values of r 
for which 



determine the limits of the motion as regards distance from the centre. 
When equation (12.9) is satisfied, the radial velocity f is zero. This does 
not mean that the particle comes to rest as in true one-dimensional 
motion, since the angular velocity $ is not zero. The value r = 0 
indicates a turning point of the path, where r(t) begins to decrease 
instead of increasing, or vice versa. 

If the range in which r may vary is limited only by the condition 
r r min , the motion is infinite : the particle comes from, and returns 
to, infinity. 

If the range of r has two limits r min and r max , the motion is finite and 
the path lies entirely within the annulus bounded by the circles r = r^^ 
and r — r min . This does not mean, however, that the path must be a 
closed curve. During the time in which r varies from r max to r min and 
back, the radius vector turns through an angle A</> which, according 




(12.7) 



U eS = U(r) + M z l2mr*. 



(12.8) 



U(r) + M z /2mr z = E 



(12.9) 
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to (12.7), is given by 

r max 

A * = 2 J vthff-w-^w ■ <1210) 

''min 

The condition for the path to be closed is that this angle should 
be a rational fraction of 2jt, i.e. that = 2nni/n 2 , where «i and n 2 
are integers. In that case, after n 2 periods, the position vector of the 
particle will have made «i complete revolutions and will occupy its 
original position, so that the path is closed. 




Fig. 6 



Such cases are exceptional, however, and when the form of U(r) 
is arbitrary the angle A0 is not a rational fraction of 2j«. In general, 
therefore, the path of a particle executing a finite motion is not closed. 
It passes through the minimum and maximum distances an infinity 
of times, and after infinite time it covers the entire annulus between 
the two bounding circles. The path shown in Fig. 6 is an example. 

There are only two types of central field in which all finite motions 
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take place in closed paths. They are those in which the potential 
energy of the particle varies as 1 \r or as r 2 . The former case is discussed 
in §13; the latter is that of the space oscillator (see §19, Problem 3). 

§13. Kepler's problem 

An important class of central fields is formed by those in which 
the potential energy is inversely proportional to r, and the force accord- 
ingly inversely proportional to r 2 . They include the fields of New- 
tonian gravitational attraction and of Coulomb electrostatic inter- 
action; the latter may be either attractive or repulsive. 

Let us first consider an attractive field, where 

U= -x/r (13.1) 
with a a positive constant. The "effective" potential energy 

M 2 



x 

r 2mr 2 



(13.2) 



is of the form shown in Fig. 7. As r 0, U eS tends to + °° , and as 
r —■ oo it tends to zero from negative values ; for r — M 2 /mcc it has a 
minimum value 

Oeff.mia = -l»tf/2Af» (13.3) 




Fig. 7 
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It is seen at once from Fig. 7 that the motion is finite for E < 0 and 
infinite for E 2=0. 

The shape of the path is obtained from the general formula (12.7). 
Substituting there U = — ce/r and effecting the elementary integration, 
we have 

, , (M/r)-(ma/M) , t t 

<b = cos -1 ' — - — ir-^-+ constant. 

Taking the origin of (j) such that the constant is zero, and putting 

p = M 2 /ma, e = + (2£M 2 /m« 2 )], (13.4) 

we can write the equation of the path as 

p/r = 1 + e cos 4>. (13.5) 

This is the equation of a conic section with one focus at the origin; 2p 
is called the latus rectum of the orbit and e the eccentricity. Our choice 
of the origin of 0 is seen from (13.5) to be such that the point where 
0 = 0 is the point nearest to the origin. 

In the equivalent problem of two particles interacting according to 
the law (13.1), the orbit of each particle is a conic section, with one 
focus at the centre of mass of the two particles. 

It is seen from (13.4) that, if £ < 0, then the eccentricity e < 1, i.e. 
the orbit is an ellipse (Fig. 8) and the motion is finite, in accordance 
with what has been said earlier in this section. According to the for- 




Fig. 8 
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mulae of analytical geometry, the major and minor semi-axes of the 
ellipse are 

a = pl(l- e *) = <x/2\E\, b = WVO = M/V(2m\E\). (13.6) 

The least possible value of the energy is (13.3), and then e = 0, i.e. 
the ellipse becomes a circle. It may be noted that the major axis of 
the ellipse depends only on the energy of the particle, and not on its 
angular momentum. The least and greatest distances from the centre 
of the field (the focus of the ellipse) are 

r min = p/(l + e) = a(l -e), r max = p/(l-e) = a(l+e). (13.7) 

These expressions, with a and e given by (13.6) and (13.4), can, of 
course, also be obtained directly as the roots of the equation U cS {r) = 
= E. 

The period T of revolution in an elliptical orbit is conveniently 
found by using the law of conservation of angular momentum in the 
form of the area integral (12.3). Integrating this equation with respect 
to time from zero to T, we have 2am/ = TM, where / is the area of the 
orbit. For an ellipse / = nab, and by using the formulae (13.6) we find 

= 7c<K- K /(m/2\E\z). (13.8) 

The proportionality between the square of the period and the cube 
of the linear dimension of the orbit is called Kepler's third law. It may 
also be noted that the period depends only on the energy of the particle. 

For E 3= 0 the motion is infinite. If E > 0, the eccentricity e > 1, 
i.e. the path is a hyperbola with the origin as internal focus (Fig. 9). 
The least distance from the focus is 

r mia = p/(e+l) = a(e-l), (13.9) 

where a = p/(e 2 -l) = <x./2Eis the "semi-axis" of the hyperbola. 

If E = 0, the eccentricity e = 1, and the particle moves in a parabola 
with least distance r min = \p. This case occurs if the particle starts 
from rest at infinity. 
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Fig. 9 



Let us now consider motion in a repulsive field, where 
U = <x/r (a > 0). 
Here the effective potential energy is 



a M 2 

U e g = — (- -j. — i 
r 2wr 2 



(13.10) 




Fig. 10 
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and decreases monotonically from + °° to zero as r varies from zero to 
infinity. The energy of the particle must be positive, and the motion 
is always infinite. The calculations are exactly similar to those for the 
attractive field. The path is a hyperbola (or, if E = 0, a parabola) : 

p/r = -l + e cos </>, (13.11) 

where p and e are again given by (13.4). The path passes the centre of 
the field in the manner shown in Fig. 10. The least distance is 



r mi0 =pl(e-l) = a(e+l). (13.12) 



CHAPTER 4 



COLLISIONS BETWEEN PARTICLES 



§14. Elastic collisions 

In many cases the laws of conservation of momentum and energy 
alone can be used to obtain important results concerning the prop- 
erties of various mechanical processes. It should be noted that these 
properties are independent of the particular type of interaction 
between the particles involved. 

Let us consider an elastic collision between two particles, i.e. one 
which involves no change in their internal state. When the law of 
conservation of energy is applied to such a collision, the internal 
energy of the particles may be neglected. 

The laboratory system is a frame of reference in which one of the 
particles (w 2 , say) is at rest before the collision and the other (mi) 
moves with velocity v. The collision is most simply described, however, 
in another frame, where the centre of mass of the two particles is at 
rest (the centre-of-mass system) ; the values of quantities in this system 
will be distinguished by the suffix 0. The velocities of the particles 
before the collision are related to the velocity v in the labora- 
tory system by vi 0 = m 2 \l(mi + m 2 ), v 2 o = — wiv/(wi + w 2 ); see 
(11.2). 

Because of the law of conservation of momentum, the momenta of 
the two particles remain equal and opposite after the collision, and 
are also unchanged in magnitude, by the law of conservation of energy. 
Thus, in the centre-of-mass system the collision simply rotates the 
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velocities, which remain opposite in direction and unchanged in 
magnitude. If we denote by n 0 a unit vector in the direction of the 
velocity of the particle mi after the collision, then the velocities of the 
two particles after the collision (distinguished by primes) are 

\' 10 = m 2 va 0 l(m 1 + m 2 ), v 20 = -m^vt^lim^m^). (14.1) 

In order to return to the laboratory system, we must add to these 
expressions the velocity V of the centre of mass. The velocities in the 
laboratory system after the collision are therefore 



No further information about the collision can be obtained from 
the laws of conservation of momentum and energy. The direction of 
the vector n 0 depends on the law of interaction of the particles and on 
their relative position during the collision. 

The formulae (14.2) for the laboratory system may be interpreted 
geometrically. Here it is more convenient to use momenta instead of 
velocities. Multiplying equations (14.2) by mi and m 2 respectively, we 
obtain 



where m = wiw 2 /(wi+W2) is the reduced mass. We draw a circle of 
radius mv and use the construction shown in Fig. 1 1 . If the unit vector 
n 0 is along OC, the vectors AC and CB give the momenta and 
respectively. When pi is given, the radius of the circle and the points 
A and B are fixed, but the point C may be anywhere on the circle. 
The point A lies inside or outside the circle, according as mi < m-i 
or mi > m 2 . The corresponding diagrams are shown in Figs. 11a, b. 
The angles 6i and 6 2 in these diagrams are the angles between the 
directions of motion after the collision and the direction of impact 
(i.e. of pi). The angle at the centre, denoted by %, which gives the 



\[ = m 2 vn 0 l{m l + m 2 ) + m 1 \j{m 1 + m 2 ), 
v 2 = -m 1 vn 0 /(wi + m 2 )+Wiv/(w 1 + w 2 ). 



(14.2) 




(14.3) 
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Fig. 11 



direction of n 0 , is the angle through which the direction of motion of 
mi is turned in the centre-of-mass system. It is evident from the figure 
that 0i and 6 2 can be expressed in terms of % by 

tan Ql = ™ 2siD * , 6 2 = ±(n- X ). (14.4) 
mi+m 2 cos% 

We may give also the formulae for the magnitudes of the velocities of 
the two particles after the collision, likewise expressed in terms of % : 

V(ml + ml+2 mi m 2 co SX ) Imy ^ 

1 mi+m 2 m 1 +m 2 



The sum di + 6 2 is the angle between the directions of motion of the 
particles after the collision. Evidently 6i+6 2 > \% if m\ < m 2 , and 
Qi + 6 2 < \n if mi m 2 . 

When the two particles are moving afterwards in the same or in 
opposite directions (head-on collision), we have %=it, i.e. the point 
C lies on the diameter through A, and is on OA (Fig. lib; and y' 2 
in the same direction) or on OA produced (Fig. 11a; and P2 in 
opposite directions). 
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In this case the velocities after the collision are 

m 1 -m 2 , 2mi 

v, v 2 = • v. (14.6) 



m 1 + m 2 «i+«2 

This value of v 2 has the greatest possible magnitude, and the maximum 
energy which can be acquired in the collision by a particle originally 
at rest is therefore 

c" 1 '2 4/MiW2 _ . 

^2max = 2 >Wmax = ( mi + m2 )2 E * ( 14J ) 

where E x = \nnv\ is the initial energy of the incident particle. 
If mi < w 2 , the velocity of nti after the collision can have any 




Fig. 12 



direction. If mi =- w 2 , however, this particle can be deflected only 
through an angle not exceeding 0 maji from its original direction ; this 
maximum value of Oi corresponds to the position of C for which AC 
is a tangent to the circle (Fig. lib). Evidently 

sin 0 max = OC/OA = nti/nt!. (14.8) 

The collision of two particles of equal mass, of which one is initially 
at rest, is especially simple. In this case both B and A lie on the circle 
(Fig. 12). 
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Then 

9i = tX, 0 a = !(»-*), 04.9) 

v[ = v cos \x> = v sin \%. (14.10) 

After the collision the particles move at right angles to each other. 

§15. Scattering 

As already mentioned in §14, a complete calculation of the result of 
a collision between two particles (i.e. the determination of the angle %) 
requires the solution of the equations of motion for the particular 
law of interaction involved. 

We shall first consider the equivalent problem of the deflection of a 
single particle of mass m moving in a field U(r) whose centre is at rest 




Fig. 13 



(and is at the centre of mass of the two particles in the original prob- 
lem). 

The path of a particle in a central field is symmetrical about a line 
from the centre to the nearest point in the orbit (OA in Fig. 13). 
Hence the two asymptotes to the orbit make equal angles (0o, say) 
with this line. The angle % through which the particle is deflected as it 
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passes the centre is seen from Fig. 13 to be 

X = |w-2M 05.1) 
The angle <j) 0 itself is given, according to (12.7), by 

(M/r 2 )dr 



4>o 



(15.2) 



V{2m[£-t/(r)]-MV 2 } ' 

'"mln 

taken between the nearest approach to the centre and infinity. It should 
be recalled that r min is a zero of the radicand. 

For an infinite motion, such as that considered here, it is convenient 
to use instead of the constants E and M the velocity of the particle 
at infinity and the impact parameter p. The latter is the length of the 
perpendicular from the centre O to the direction of v^, i.e. the distance 
at which the particle would pass the centre if there were no field of 
force (Fig. 13). The energy and the angular momentum are given in 
terms of these quantities by 

E = Imvt, M = mqv^, (15.3) 



and formula (15.2) becomes 



4>o 



VU-&lr*)-pU/mvl)] 



(15.4) 



Together with (15.1), this gives % as a function of o. 

In physical applications we are usually concerned not with the 
deflection of a single particle but with the scattering of a beam of 
identical particles incident with uniform velocity on the scattering 
centre. The different particles in the beam have different impact 
parameters and are therefore scattered through different angles %. 
Let diV be the number of particles scattered per unit time through 
angles between % and %+d%. This number itself is not suitable for 
describing the scattering process, since it is proportional to the density 
of the incident beam. We therefore use the ratio 



da = dN/n, 



(15.5) 
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where n is the number of particles passing in unit time through unit 
area of the beam cross-section (the beam being assumed uniform 
over its cross-section). This ratio has the dimensions of area and is 
called the {effective) scattering cross-section. It is entirely determined by 
the form of the scattering field and is the most important characteristic 
of the scattering process. 

We shall suppose that the relation between % and q is one-to-one ; 
this is so if the angle of scattering is a monotonically decreasing 
function of the impact parameter. In that case, only those particles 
whose impact parameters lie between q(jO and d@(%) are scattered 
at angles between % and %+d%. The number of such particles is equal 
to the product of n and the area between two circles of radii q and 
o+dg, i.e. diV = 2nq dg-n. The effective cross-section is therefore 

da = 2nqdq. (15.6) 

In order to find the dependence of da on the angle of scattering, we 
need only rewrite (15.6) as 

da = 2u Q (x)\dQ(x)/dx\d X . (15.7) 

Here we use the modulus of the derivative dg/d%, since the derivative 
may be (and usually is) negative. Often da is referred to the solid 
angle element do instead of the plane angle element d%. The solid 
angle between cones with vertical angles % and %+d% is do = 2tc 
sin % d%. Hence we have from (15.7) 

dQ 



da- Q W 



dx 



do. (15.8) 



Returning now to the problem of the scattering of a beam of par- 
ticles, not by a fixed centre of force, but by other particles initially at 
rest, we can say that (15.7) gives the effective cross-section as a function 
of the angle of scattering in the centre-of-mass system. To find the 
corresponding expression as a function of the scattering angle 6 in 
the laboratory system, we must express % in (15.7) in terms of 0 by 
means of formulae (14.4). This gives expressions for both the scattering 
cross-section for the incident beam of particles (% in terms of Oi) and 
that for the particles initially at rest (% in terms of 0 2 ). 
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PROBLEMS 

Problem 1. Determine the effective cross-section for scattering of particles from 
a perfectly rigid sphere of radius a (i.e. when the interaction is such that U = oo 
for r < a and U = 0 for r >■ a). 

Solution. Since a particle moves freely outside the sphere and cannot penetrate 
into it, the path consists of two straight lines symmetrical about the radius to the 
point where the particle strikes the sphere (Fig. 14). It is evident from Fig. 14 
that 

q = osin</> 0 = a sin {(7r- = acos\x. 




Fig. 14 



Substituting in (15.7) or (15.8), we have 

dcr = \na^ sin % &% = {a 2 do, (1) 

i.e. the scattering is isotropic in the centre-of-mass system. On integrating da over 
all angles, we find that the total cross-section a = jio 2 , in accordance with the fact 
that the "impact area" which the particle must strike in order to be scattered is 
simply the cross-sectional area of the sphere. 

Problem 2. Express the effective cross-section (Problem 1) as a function of the 
energy e lost by a scattered particle. 

Solution. The energy lost by a particle of mass m x is equal to that gained by the 
sphere of mass m t . From (14.5) and (14.7), e = E' t = [2mfm 2 /(mj + mj 1 ) sin 2 \x 
= enuaSin 2 ^. whence de = sin % &%; substituting in (1), Problem 1, we 
have da = jia 8 de/e m „. The scattered particles are uniformly distributed with 
respect to e in the range from zero to 

Problem 3. Determine the effective cross-section for particles of mass m, to 
strike a sphere of mass m 2 and radius R to which they are attracted in accordance 
with Newton's law. 

Solution. The condition for a particle to reach the sphere is that r mSn < R, 
where r mln is the point on the path which is nearest to the centre of the sphere. The 
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greatest possible value of q is given by r min = R; this is equivalent to U e1l (R) — E 
or jmjuLeLi/^ 2 - 1 "/^ = |«i*>io, where a = ymxirii (y being the gravitational 
constant) and we have put m « m t on the assumption that m 2 >:> »h- Solving for 
elai. we finally obtain a = ji.R 2 (1 +2>>m 2 /.Ri>L). When <» the effective cross- 

section tends, of course, to the geometrical cross-section of the sphere. 



§16. Rutherford's formula 

One of the most important applications of the formulae derived 
above is to the scattering of charged particles in a Coulomb field. 
Putting in (15.4) U = a/r and effecting the elementary integration, we 
obtain 

x/mvtQ 
<po = cos 1 /M , , , — o , 

whence q 2 = (oP/nPvlJ tan 2 (j> 0 , or, putting (j> 0 = y(j«— %) from (15.1), 

e> = (a«/ifft£)cot«i- Z . (16.1) 

Differentiating this expression with respect to % and substituting 
in (15.7) or (15.8) gives 

da = 7c(pc/mvl,f cos \% d%/sin 3 \% (16.2) 

or 

da = (a/2ifi»L) t do/sin* -J- z . (1 6.3) 

This is Rutherford's formula. It may be noted that the effective cross- 
section is independent of the sign of a, so that the result is equally 
valid for repulsive and attractive Coulomb fields. 

Formula (16.3) gives the effective cross-section in the frame of 
reference in which the centre of mass of the colliding particles is at 
rest. The transformation to the laboratory system is effected by means 
of formulae (14.4). For particles initially at rest we substitute 
% = 7t—2d 2 in (16.2) and obtain 

da 2 = 2jr(a//m>L) 2 sin02d0 2 /cos 3 02 

= (oc/mvl)* do 2 /cos 3 0 2 . (1 6.4) 

The same transformation for the incident particles leads, in general, 
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to a very complex formula, and we shall merely note two particular 
cases. 

If the mass m 2 of the scattering particle is large compared with the 
mass nti of the scattered particle, then % k 61 and m « mi, so that 

dffi = (a/4£i) 2 doi/sin*l0i, (16.5) 

where E\ = ^m^ is the energy of the incident particle. 

If the masses of the two particles are equal (mi = m 2 , m = \m{), 
then by (14.9) % = 20i, and substitution in (16.2) gives 

dffi = 2rc(a/£i) 2 cos0id0i/sin 3 0i 

= (oc/Eif cos 0i dd/sin 4 0i. (16.6) 

If the particles are entirely identical, that which was initially at rest 
cannot be distinguished after the collision. The total effective cross- 
section for all particles is obtained by adding dffi and dff 2 , and replac- 
ing 0i and 0 2 by their common value 0 : 

dff = (a/^W-* + — L \ cos 6 do. (16.7) 
\ sin* 0 cos* 0 J 

Let us return to the general formula (16.2) and use it to determine 
the distribution of the scattered particles with respect to the energy 
lost in the collision. When the masses of the scattered (mi) and scatter- 
ing (m 2 ) particles are arbitrary, the velocity acquired by the latter is 
»iven in terms of the angle of scattering in the centre-of-mass system 
3y v' 2 = [2m 1 /(mi+m 2 )]v ex> sin ~%\ see (14.5). The energy acquired by 
« 2 and lost by mi is therefore e = ym 2 w' 2 2 = (2m 2 /m 2 )«;f 0 sin 2 \%. 
Expressing sin \% in terms of e and substituting in (16.2), we obtain 

dff = 2ac(a 2 //W2»L) de/e 2 . (16.8) 

rhis is the required formula: it gives the effective cross-section as a 
unction of the energy loss e, which takes values from zero to e max = 
= 2m 2 v 2 Jm 2 . 
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§17. Free oscillations in one dimension 

A very common form of motion of mechanical systems is what are 
called small oscillations of a system about a position of stable equi- 
librium. We shall consider first of all the simplest case, that of a system 
with only one degree of freedom. 

Stable equilibrium corresponds to a position of the system in which 
its potential energy U(q) is a minimum. A movement away from this 
position results in the setting up of a force — dU/dq which tends to 
return the system to equilibrium. Let the equilibrium value of the 
generalised coordinate q be q 0 . For small deviations from the equi- 
librium position, it is sufficient to retain the first non-vanishing term 
in the expansion of the difference U(q)—U(q 0 ) in powers of q — q 0 . In 
general this is the second-order term: U(q)—U(q 0 ) «s ^k(q — q 0 ) 2 , 
where A: is a positive coefficient, the value of the second derivative 
U"(q) for q = qo- We shall measure the potential energy from its 
minimum value, i.e. put U(q 0 ) = 0, and use the symbol 

x = q-q 0 (17.1) 

for the deviation of the coordinate from its equilibrium value. Thus 

U(x) = ika*. (17.2) 

The kinetic energy of a system with one degree of freedom is in 
general of the form \a(q)q 2 = ^a(q)x 2 . In the same approximation, it 
is sufficient to replace the function a(q) by its value at q — q 0 . Putting 
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for brevity t a(q 0 ) — m, we have the following expression for the 

Lagrangian of a system executing small oscillations in one dimen- 
sion :% 

L = \m&-\k#. (17.3) 
The corresponding equation of motion is 

mx+kx = 0, (17.4) 

or 

x+aPx = 0, (17.5) 

where 

co = y/(k/m). (17.6) 

Two independent solutions of the linear differential equation (17.5) 
are cos cot and sin cot, and its general solution is therefore 

x = cicoscot+c t sincot. (17.7) 

This expression can also be written 

x = acos (cot+a.). (17.8) 

Sincecos(ct)f-f-a) = coscofcosa — sin co/ sin a, a comparison with (17.7) 
shows that the arbitrary constants a and a are related to ci and c 2 by 

a = V(4+4), tan a = -c 2 / Cl . (17.9) 

Thus, near a position of stable equilibrium, a system excutes har- 
monic oscillations. The coefficient a of the periodic factor in (17.8) is 
called the amplitude of the oscillations, and the argument of the 
cosine is their phase; a is the initial value of the phase, and evidently 
depends on the choice of the origin of time. The quantity co is called 
the angular frequency of the oscillations ; in theoretical physics, how- 
ever, it is usually called simply the frequency, and we shall use this 
name henceforward. 

The frequency is a fundamental characteristic of the oscillations, and 
is independent of the initial conditions of the motion. According to 



t It should be noticed that m is the mass only if x is the Cartesian coordinate. 
% Such a system is often called a one-dimensional oscillator. 
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formula (17.6) it is entirely determined by the properties of the mechan- 
ical system itself. It should be emphasised, however, that this prop- 
erty of the frequency depends on the assumption that the oscillations 
are small, and ceases to hold in higher approximations. Mathematic- 
ally, it depends on the fact that the potential energy is a quadratic 
function of the coordinate. 

The energy of a system executing small oscillations is E — ~mx 2 + 
+ ±kx 2 = |w(x 2 +ft) 2 x 2 ) or, substituting (17.8), 

E=\mo?a*. (17.10) 

It is proportional to the square of the amplitude. 

The time dependence of the coordinate of an oscillating system is 
often conveniently represented as the real part of a complex express- 
ion: 

x = re [Aexp(-ia)t)], (17.11) 

where A is a complex constant ; putting 

^ = aexp(-/a), (17.12) 

we return to the expression (17.8). The constant A is called the complex 
amplitude; its modulus is the ordinary amplitude, and its argument 
is the initial phase. 

The use of exponential factors is mathematically simpler than that 
of trigonometrical ones because they are unchanged in form by 
differentiation. So long as all the operations concerned are linear 
(addition, multiplication by constants, differentiation, integration), 
we may omit the sign re throughout and take the real part of the final 
result. 

PROBLEMS 

Problem 1. Express the amplitude and initial phase of the oscillations in terms 
of the initial coordinate x 0 and velocity v 0 . 

Solution, a = V(*o+ v o/ to *)> tan a = - v ol wx o- 

Problem 2. Find the ratio of frequencies to and to' of the oscillations of two 
diatomic molecules consisting of atoms of different isotopes, the masses of the 
atoms being m u m 2 and m[, m' % . 
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Solution. Since the atoms of the isotopes interact in the same way, we have 
k = k'. The coefficients m in the kinetic energies of the molecules are their reduced 
masses. According to (17.6) we therefore have 

w' / m l m i (m' l + m£) 

o> V m' 1 m' i (im 1 + m 2 ) 

Problem 3. Find the frequency of oscillations of a particle of mass m which is 
free to move along a line and is attached to a spring whose other end is fixed at a 
point A (Fig. 15) at a distance / from the line. A force Fis required to extend the 
spring to length /. 




A 



It 



Fig. 15 

Solution. The potential energy of the spring is (to within higher-order terms) 
equal to the force F multiplied by the extension dl of the spring. For x « / we have 
SI = V(l 2 +x 2 )-lcxx*l21,so that U =- F*72/. Since the kinetic energy is \mx 2 , 
we have at = y/(F\ml). 

§18. Forced oscillations 

Let us now consider oscillations of a system on which a variable 
external force acts. These are called forced oscillations, whereas those 
discussed in §17 are free oscillations. Since the oscillations are again 
supposed small, it is implied that the external field is weak, because 
otherwise it could cause the displacement x to take too large values. 

The system now has, besides the potential energy \kx*, the additional 
potential energy U e {x, t) resulting from the external field. Expanding 
this additional term as a series of powers of the small quantity x, we 
have U e {x, t) ^ U e (0, t)+x[dU e /dx] x=0 . The first term is a function 
of time only, and may therefore be omitted from the Lagrangian, as 
being the total time derivative of another function of time. In the 
second term — [dU e jdx] X!=0 is the external "force" acting on the system 
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in the equilibrium position, and is a given function of time, which 
we denote by F(t). Thus the potential energy involves a further term 
— xF(t), and the Lagrangian of the system is 

L = \mx 2 -\kx 2 +xF(t). (18.1) 

The corresponding equation of motion is mx+kx = F(t) or 

x+afix = F{t)lm, (18.2) 

where we have again introduced the frequency co of the free oscilla- 
tions. 

The general solution of this inhomogeneous linear differential 
equation with constant coefficients is x = xo+xi, where xo is the 
general solution of the corresponding homogeneous equation and X\ 
is a particular integral of the inhomogeneous equation. In the present 
case xo represents the free oscillations discussed in §17. 

Let us consider a case of especial interest, where the external force 
is itself a simple periodic function of time, of some frequency y : 

F(t)=fcos(yt+P). (18.3) 

We seek a particular integral of equation (18.2) in the form Xi = 
= b cos (yt + /?), with the same periodic factor. Substitution in that equa- 
tion gives b = fjm(co 2 —y 2 ); adding the solution of the homogeneous 
equation, we obtain the general integral in the form 

x = acos(cot+x)+[f/m(co*-yZ)]cos(yt+P). (18.4) 

The arbitrary constants a and a are found from the initial conditions. 

Thus a system under the action of a periodic force executes a motion 
which is a combination of two oscillations, one with the intrinsic 
frequency co of the system and one with the frequency y of the force. 

The solution (18.4) is not valid when resonance occurs, i.e. when the 
frequency y of the external force is equal to the intrinsic frequency co 
of the system. To find the general solution of the equation of motion 
in this case, we rewrite (18.4) as 

x = a cos (cot+x) + [f/m(co* -y 2 )] [cos (y/+/3) - cos (cot +/S)], 
where a now has a different value. As y — co, the second term is 
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indeterminate, of the form 0/0. Resolving the indeterminacy by 
L'Hospital's rule, we have 

x = a cos (cot+x)+(f/2mco)t s'm(cot+P). (18.5) 

Thus the amplitude of oscillations in resonance increases linearly with 
the time (until the oscillations are no longer small and the whole 
theory given above becomes invalid). 

Let us also ascertain the nature of small oscillations near resonance, 
when y = co + e with e a small quantity. We put the general solution 
in the complex form 

x — A exp ( — icot) + B exp [ — i(co + e)t] 
= [A + B exp (-ist)]exp (-icot). (18.6) 

Since the quantity A+B exp (— ist) varies only slightly over the period 
2n/co of the factor exp(— icot), the motion near resonance may be 
regarded as small oscillations of variable amplitude. Denoting this 
amplitude by C, we have C = \A+B exp (—iet)\. Writing A and B in 
the form a exp (—/a) and b exp (—iff) respectively, we obtain 

C a = fl a + 6 a + 2a6cos(eH-/i-a). (18.7) 

Thus the amplitude varies periodically with frequency e between the 
limits \a— b\ =s C *sa+b. This phenomenon is called beats. 

The equation of motion (18.2) can be integrated in a general form 
for an arbitrary external force F(t). This is easily done by rewriting 
the equation as 

d 1 

~j-(x—icox)+ico(x—icox) — — F(t) 
at m 

or 

d|/df+ icot = F{t)lm, (18.8) 

where 

I = x-icox (18.9) 

is a complex quantity. Equation (18.8) is of the first order. Its solution 
when the right-hand side is replaced by zero is I = A exp (—icot) with 
constant A. As usual, we seek a solution of the inhomogeneous 
equation in the form I = A(t) exp (-icot), obtaining for the function 
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A{t) the equation A{t) = F{t) exp(/co/)/w. Integration gives the solution 
of (18.8): 



:-*>{[: 



I = exp(-ooO] | ^F{t)exp{icot)dt+£ 0 \, (18.10) 



where the constant of integration | 0 is the value of I at the instant 
t = 0. This is the required general solution; the function x{t) is given 
by the imaginary part of (18.10), divided by — co.* 

The energy of a system executing forced oscillations is naturally 
not conserved, since the system gains energy from the source of the 
external field. Let us determine the total energy transmitted to the 
system during all time, assuming its initial energy to be zero. According 
to formula (18.10), with the lower limit of integration — °° instead of 
zero and with £(— ») = 0, we have for t — °° 



l!(~)l 2 = -v I I F(0exp(fo0df| 



f 



The energy of the system is 

E = lw(x 2 +co 2 x 2 ) = lm|£] 2 . (18.1 1) 

Substituting 1 1( °° ) | 2 , we obtain the energy transferred : 



* 2m 



f 



F(t) exp {wot) dt 



(18.12) 



it is determined by the squared modulus of the Fourier component of 
the force F{t) whose frequency is the intrinsic frequency of the system. 

In particular, if the external force acts only during a time short in 
comparison with 1/co, we can put exp(«o() ^ 1. Then 



£ 2m 



( J"™*)*. 



* The force f(r)must, of course, be written in real form. 
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This result is obvious: it expresses the fact that a force of short 
duration gives the system a momentum j F dt without bringing about 
a perceptible displacement. 

PROBLEMS 

Problem 1 . Determine the forced oscillations of a system under a force F(/) of 
the following forms, if at time / = 0 the system is at rest in equilibrium (x = x = 0) : 
(a) F = F„, a constant, (b) F = at, (c) F = F„ exp (-a/), (d) F = F„ exp (-a/)X 
Xcos /?/. 

Solution, (a) * = (F„/ma> 2 )(l— cos to/). The action of the constant force results 
in a displacement of the position of equilibrium about which the oscillations take 
place. 

(b) x = (fljmm*){a>t— sin a>t). 

(c) x = [F 0 /m(&> 2 +a 2 )][exp(-a/)-cos&>/+(a/co)sinco/]. 

(d) x = F„{ -(to 2 + a 2 - jS 2 ) cos to/ + (a/to) (to 2 + a 2 +/S 2 ) sin <o/ + 

+ exp ( - at) [(<u 2 + a 2 - /S 2 ) cos 0/ - 2a/S sin |3/]}/m [(to 2 + a 2 - /S 2 ) 2 + 4a 2 /? 2 ] . 

This last case is conveniently treated by writing the force in the complex form 

F= F o exp[(-a-i0)/]. 

Problem 2. Determine the final amplitude for the oscillations of a system under a 
force which is zero for / < 0, F 0 //rfor 0 / < T, and F„ for / => T(Fig. 16), if up 
to time / = 0 the system is at rest in equilibrium. 




/ 



Fig. 16 



Solution. During the interval 0 < / T the oscillations are determined by the 
initial condition as x = (F 0 lmTa> 3 )(o}t - sin cot). For ( > Twe seek a solution in the 
form 

x = c l cosco(t — T) + C2s'moL)(t—T)+F 0 lmoL) 2 . 

The continuity of x and x at / = T gives c t = — (FJmTa) 3 ) sin ojT, c 2 = 
= (FJmTco 3 ) (1 - cos a>T). The amplitude is a = V(ci 2 + cf) = (2F 0 /m7co 3 ) sin \a>T. 
This is the smaller, the more slowly the force F„ is applied (i.e. the greater T). 
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Problem 3. The same as Problem 2, but for a constant force F 0 which acts for a 
finite time T (Fig. 17). 

Solution. As in Problem 2, or more simply by using formula (18.10). For 
r > Twe have free oscillations about x = 0, and 



F r 

f = — — exp ( — icot) exp (icot) dt 

171 J 



iF 

= — - [1 - exp (for)] exp (-icot). 
com 



T 

Fig. 17 

The squared modulus of I gives the amplitude from the relation 1 1 | 2 = nV, The 
result is 

a = (2F 0 /mco ! ) sin f cor. 



§19. Oscillations of systems with more than one degree of freedom 

The theory of free oscillations of systems with s degrees of freedom 
is analogous to that given in §17 for the case s — 1. 

Let the potential energy of the system U as a function of the general- 
ised coordinates q t (/ = 1, 2, s) have a minimum for q t = g i0 . 
Putting 

Xi = qi-q i0 (19.1) 

for the small displacements from equilibrium and expanding U as a 
function of the jc,- as far as the quadratic terms, we obtain the potential 
energy as a positive definite quadratic form 

tf = i (19.2) 

i,k 



where we again take the minimum value of the potential energy as 
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zero. Since the coefficients k lk and k kl in (19.2) multiply the same quan- 
tity XjX k , it is clear that they may always be considered equal : k ik = k kl . 

In the kinetic energy, which has the general form \^a ik {q)qjq k 
(see (5.5)), we put q, = q i0 in the coefficients a ik and, denoting a ik (q 0 ) 
by m lk , obtain the kinetic energy as a positive definite quadratic form 

| (19.3) 

i.k 

The coefficients m ik also may always be regarded as symmetrical: 
m ik = m ki . Thus the Lagrangian of a system executing small free 
oscillations is 

L = I £ (rn lk x t x k -k ik x,x k ). (19.4) 

i.k 

Let us now derive the equations of motion. To determine the deriva- 
tives involved, we write the total differential of the Lagrangian : 

dL = |£ (m ik xi dx k + m lk x k dx,- ~k, k xi dx k -k ik x k dx,). 

i.k 

Since the value of the sum is obviously independent of the naming 
of the suffixes, we can interchange / and k in the first and third terms 
in the parentheses. Using the symmetry of m ik and k ik , we have 

dL = £ (m lk x k dx, -k lk x k dx,). 

Hence 

dL/dxt = £ m ik x k , 9L/9x, = - J] k lk x k . 

k k 

Lagrange's equations are therefore 

I m,kXk + £ k ik x k = 0 (/ = 1 , 2, . . . , s) ; (19.5) 

k k 

they form a set of s linear homogeneous differential equations with 
constant coefficients. 
As usual, we seek the s unknown functions x k (t) in the form 

x k = A k exp ( - icot), (1 9.6) 

where A k are some constants to be determined. Substituting (19.6) in 
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the equations (19.5) and cancelling exp(— icot), we obtain a set of 
linear homogeneous algebraic equations to be satisfied by the A k : 

£(-ay>m ik + k ik )A k = 0. (19.7) 

k 

If this system has non-zero solutions, the determinant of the coeffi- 
cients must vanish: 

\k ik -a?m ik \ = 0. (19.8) 

This is the characteristic equation and is of degree s in co 2 . In general, it 
has s different real positive roots co\ (a — 1, 2, . . ., s). The quantities 
co a thus determined are the characteristic frequencies or eigenfrequencies 
of the system. In particular cases, some of the roots may coincide ; the 
corresponding multiple eigenfrequencies are said to be degenerate. 

It is evident from physical arguments that the roots of equation 
(19.8) are real and positive. For the existence of an imaginary part of 
co would mean the presence, in the time dependence of the coordinates 
x k (19.6), and so of the velocities x k , of an exponentially decreasing 
or increasing factor. Such a factor is inadmissible, since it would lead 
to a time variation of the total energy E = U+T of the system, which 
would therefore not be conserved. 

The frequencies co a having been found, we substitute each of them 
in equations (19.7) and find the corresponding coefficients A k . Since 
these equations are homogeneous, there is an arbitrary common 
factor in the resulting values. To stress this, we shall write the coeffi- 
cients A k for any given frequency co x as A fa C a , with a specified set of 
real constants A fta and an arbitrary (complex) constant C a which does 
not depend on k. 

The particular solution of the differential equations (19.5) is there- 
fore 

x k = A^C exp ( -icoj). 

The general solution is the sum of all the particular solutions. Taking 
the real part, we write 

** = £ A *«<2«, (19.9) 
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where 

Q a = re [C* exp ( -icoj)]. (19.10) 

Thus the time variation of each coordinate of the system is a 
superposition of s simple periodic oscillations Q v Q 2 , Q s with 
arbitrary amplitudes and phases but definite frequencies. 

The question naturally arises whether the generalised coordinates 
can be chosen in such a way that each of them executes only one 
simple oscillation. The form of the general integral (19.9) points to 
the answer. For, regarding the s equations (19.9) as a set of equations 
for s unknowns Q a , we can express Q v Q 2 , Q s in terms of the 
coordinates x v x 2 , . . . , x s . The quantities Q a may therefore be regarded 
as new generalised coordinates, called normal coordinates, and they 
execute simple periodic oscillations, called normal oscillations of the 
system. 

The normal coordinates Q x are seen from their definition to satisfy 
the equations 

& + = 0. (19.11) 

This means that in normal coordinates the equations of motion become 
s independent equations. The acceleration in each normal coordinate 
depends only on the value of that coordinate, and its time dependence 
is entirely determined by the initial values of the coordinate and of the 
corresponding velocity. In other words, the normal oscillations of the 
system are completely independent. 

It is evident that the Lagrangian expressed in terms of normal 
coordinates is a sum of expressions each of which corresponds to 
oscillation in one dimension with one of the frequencies co a , i.e. it is 
of the form 

a 

where each m a is a positive constant which can take any value if the 
A Aa in (19.9) are changed by an appropriate factor. The normal 
coordinates are usually so chosen as to make m„= I. Then the 
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Lagrangian of the system is* 

£ = IE 09.12) 

If we have a system of interacting particles not in an external field, 
not all of its degrees of freedom relate to oscillations. A typical example 
is that of molecules. Besides motions in which the atoms oscillate 
about their positions of equilibrium in the molecule, the whole molecule 
can execute translational and rotational motions. 

Three degrees of freedom correspond to translational motion, and 
in general the same number to rotation, so that, of the 3n degrees of 
freedom of a molecule containing n atoms, 3n— 6 correspond to 
vibration. An exception is formed by molecules in which the atoms are 
collinear, for which there are only two rotational degrees of freedom 
(since rotation about the line of atoms is of no significance), and 
therefore 3«— 5 vibrational degrees of freedom. 

The normal vibrations of the molecule may be classified according 
to the corresponding motion of the atoms on the basis of a considera- 
tion of the symmetry of the equilibrium positions of the atoms in the 
molecule. There is a general method of doing so, based on the use 
of group theory. Here we shall consider only some elementary ex- 
amples. 

If all n atoms in a molecule lie in one plane, we can distinguish 
normal vibrations in which the atoms remain in that plane from those 
where they do not. The number of each kind is readily determined. 
Since, for motion in a plane, there are In degrees of freedom, of which 
two are translational and one rotational, the number of normal 
vibrations which leave the atoms in the plane is 2n— 3. The remaining 
(3«-6)-(2«-3) = h-3 vibrational degrees of freedom correspond 
to vibrations in which the atoms move out of the plane. 



t When there are degenerate frequencies, the choice of the normal coordinates 
is not fully determined by this condition. Since those coordinates which have the 
same co x appear in the kinetic and potential energies as sums £ Ql and £ Ql which 
are transformed in the same way, they can be linearly transformed in any manner 
which does not alter these sums of squares. 
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For a linear molecule we can distinguish longitudinal vibrations, 
which maintain the linear form, from vibrations which bring the atoms 
out of line. Since a motion of n particles in a line corresponds to n 
degrees of freedom, of which one is translational, the number of 
vibrations which leave the atoms in line is w — 1. Since the total number 
of vibrational degrees of freedom of a linear molecule is 3w— 5, there 
are In— 4 which bring the atoms out of line. These In— 4 vibrations, 
however, correspond to only n—2 different frequencies, since each 
such vibration can occur in two mutually perpendicular planes 
through the axis of the molecule. It is evident from symmetry that 
each such pair of normal vibrations have equal frequencies. 

PROBLEMS 

Problem 1. Determine the oscillations of a system with two degrees of freedom 
whose Lagrangian is L = fC^+jP 1 )— jcolix'+y^+axy (two identical one- 
dimensional systems of eigenfrequency tu 0 coupled by an interaction — axy). 

Solution. The equations of motion are X+a>lx = ay, y+o>ly = ax. The 
substitution (19.6) gives 

A x (col-a}*) = aA,, ^/cojj-co 1 ) = aA x . (1) 

The characteristic equation is (tuj— to 2 ) 1 = a 2 , whence tuf = tuj— a, tui = 
= co§+a.Forco = a> u the equations (1) give A x = ,4,, and for <u = a> 2 ,A s = — A v . 
Hence x = (Qi+Q 2 )IV2,y = (Qi-Qt)IV2, the coefficients 1/V2 resulting from 
the normalisation of the normal coordinates. 

Force « oil (weak coupling) we have oj 1 tu 0 — |a/co 0 , tu 2 «s tu 0 +|a/tu 0 . The 
variation of x and y is in this case a superposition of two oscillations with 
almost equal frequencies, i.e. beats of frequency eoj-co! = oc/eo 0 (see §18). The 
amplitude of y is a minimum when that of x is a maximum, and vice versa. 

Problem 2. Determine the small oscillations of a coplanar double pendulum 
(Fig. 1,§5). 

Solution. For small oscillations (<£i « 1, (j> s ■« 1), the Lagrangian derived in 
§5, Problem 1, becomes 

L = A(m! + mjlfrpl + {m 2 ll<i>l + m^l^^ -^(m 1 + m^/^f - < m 2 gl 2 cf>l. 
The equations of motion are 

(m 1 +m^/ 1 0 l +m I /,c£j+('Hi+'n l te0i = 0, Iifi+lSt+S^z = °- 
Substitution of (19.6) gives 

A 1 (m 1 + m^ig-lico 1 )- AtO^mJt = 0, -^^ 1 /^eo l +^^ 1 te-/ I a> , ) = 0. 



64 Small Oscillations 

The roots of the characteristic equation are 



§20 



-=,^-r, {('«i + '«2)('i+ / i)±V('"i + m 2 )V'[('"i + m 2 )(/ 1 + / 2 ) 2 -4m 1 / 1 /2]}- 



As mi -*■ 00 the frequencies tend to the values V(glh) and ViglU), corresponding 
to independent oscillations of the two pendulums. 

Problem 3. Find the path of a particle in a central field U — \kr 2 (called a 
space oscillator). 

Solution. As in any central field, the path lies in a plane, which we take as the 
xy plane. The variation of each coordinate x, y is a simple oscillation with the same 
frequency w = vW"0: x = acos(<uf+a), y = 6cos(cuf+/3), or x = a cos 4>, 
y = b cos (</>+<?) = b cos 5 cos $ — b sin 5 sin where 4> = cuf+a, 5 = /S— a. 
Solving for cos cf> and sin (/> and equating the sum of their squares to unity, we find 
the equation of the path: 

x^ v 2 2xy 

—f+L, l- cos ^ = s ' n2 s - 

a 2 o 2 aft 

This is an ellipse with its centre at the origin. When 8 = 0 ovn, the path degenerates 
to a segment of a straight line. 



§20. Damped oscillations 

So far we have implied that all motion takes place in a vacuum, or 
else that the effect of the surrounding medium on the motion may be 
neglected. In reality, when a body moves in a medium, the latter 
exerts a resistance which tends to retard the motion. The energy of 
the moving body is finally dissipated by being converted into heat. 

Motion under these conditions is no longer a purely mechanical 
process, and allowance must be made for the motion of the medium 
itself and for the internal thermal state of both the medium and the 
body. In particular, we cannot in general assert that the acceleration 
of a moving body is a function only of its coordinates and velocity at 
the instant considered; that is, there are no equations of motion 
derivable from the Lagrangian by the methods of mechanics. Thus the 
problem of the motion of a body in a medium is not one of mechanics 
alone. 

There exists, however, a class of cases where motion in a medium 
can be approximately described by including certain additional terms 
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in the mechanical equations of motion. Such cases include oscillations 
with frequencies small compared with those of the dissipative processes 
in the medium. When this condition is fulfilled we may regard the 
body as being acted on by a force of friction which depends (for a 
given homogeneous medium) only on its velocity. 

If, in addition, this velocity is sufficiently small, then the frictional 
force can be expanded in powers of the velocity. The zero-order term 
in the expansion is zero, since no friction acts on a body at rest, and 
so the first non-vanishing term is proportional to the velocity. Thus 
the generalised frictional force / fr acting on a system executing small 
oscillations in one dimension (coordinate x) may be written / fr = 
= —ccx, where a is a positive coefficient and the minus sign indicates 
that the force acts in the direction opposite to that of the velocity. 
Adding this force on the right-hand side of the equation of motion, 
we obtain 

mx = —kx—acx. (20.1) 

We divide this by m and put 

k/m = <4, x/m = 2X; (20.2) 

co 0 is the frequency of free oscillations of the system in the absence of 
friction, and X is called the damping coefficient? 
Thus the equation is 

x+2Xx+colx = 0. (20.3) 

We seek a solution x — exp (rt) and obtain for r the characteristic 
equation r 2 +2Xr+a>l = 0. The general solution of equation (20.3) is 

x = ci exp(ri?) + c 2 exp(/V), n, 2 = -X±-\/(X 2 -W). 

Two cases must be distinguished. If X < co 0 , we have two complex 
conjugate values of r. The general solution of the equation of motion 
can then be written as 

x = re {A exp [ ~Xt -iVH ~^)t]}, 



t The dimensionless product AT (where T = 2ji/co is the period) is called the 
logarithmic damping decrement. 
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where A is an arbitrary complex constant, or as 

x = a exp ( —Xt) cos (cot+a), (20.4) 

with co — -y/ (co\—X 2 ) and a and a real constants. The motion described 
by these formulae consists of damped oscillations. It may be regarded 
as being harmonic oscillations of exponentially decreasing amplitude. 
The rate of decrease of the amplitude is given by the exponent X, and 
the "frequency" co is less than that of free oscillations in the absence 
of friction. For X <sc co 0 , the difference between co and co 0 is of the 
second order of smallness. The decrease in frequency as a result of 
friction is to be expected, since friction retards motion. 

If X <sc co 0 , the amplitude of the damped oscillation is almost un- 
changed during the period In/co. It is then meaningful to consider 
the mean values (over the period) of the squared coordinates and 
velocities, neglecting the change in exp(— Xt) when taking the mean. 
These mean squares are evidently proportional to exp(— 2Xi). Hence 
the mean energy of the system decreases as 

E = E 0 exp(-2Xt), (20.5) 

where E 0 is the initial value of the energy. 

Next, let X > coo. Then the values of r are both real and negative. 
The general form of the solution is 

x = Ciexpl-fA-V^-^lO + ^expl-tA+V^-^)]'}- (20.6) 

We see that in this case, which occurs when the friction is sufficiently 
strong, the motion consists of a decrease in \x\, i.e. an asymptotic 
approach (as f — «>) to the equilibrium position. This type of motion 
is called aperiodic damping. 

Finally, in the special case where X = co 0 , the characteristic equation 
has the double root r = — X. The general solution of the differential 
equation is then 

x = (d+c 2 t)exp(-Xt). (20.7) 

This is a special case of aperiodic damping. 

For a system with more than one degree of freedom, the generalised 
frictional forces corresponding to the coordinates x t are linear func- 
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tions of the velocities, of the form 

fh.t= -£«!***. (20.8) 

k 

From purely mechanical arguments we can draw no conclusions 
concerning the symmetry properties of the coefficients cn ik as regards 
the suffixes i and k, but the methods of statistical physics make it 
possible to demonstrate that in all cases 

a ifc = x ki . (20.9) 
Hence the expressions (20.8) can be written as the derivatives 

A. i = -8F/3i, (20.10) 

of the quadratic form 

i.k 

which is called the dissipative function. 

The forces (20.10) must be added to the right-hand side of 
Lagrange's equations : 

The dissipative function itself has an important physical signific- 
ance : it gives the rate of dissipation of energy in the system. This is 
easily seen by calculating the time derivative of the mechanical energy 
of the system. We have 



dE 
dt 



= ?*'(dF[ai7]~3*") 



= - Y */ . 



Since F is a quadratic function of the velocities, Euler's theorem on 
homogeneous functions shows that the sum on the right-hand side is 
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equal to IF. Thus 

dE/dt = -2F, (20.13) 

i.e. the rate of change of the energy of the system is twice the dissipative 
function. Since dissipative processes lead to loss of energy, it follows 
that F > 0, i.e. the quadratic form (20.1 1) is positive definite. 

§21. Forced oscillations under friction 

The theory of forced oscillations under friction is entirely analogous 
to that given in §18 for oscillations without friction. Here we shall 
consider in detail the case of a periodic external force, which is of 
considerable interest. 

Adding to the right-hand side of equation (20.1) an external force 
/ cos yt and dividing by m, we obtain the equation of motion : 

x + Wc + a$x = (ffm) cos yt. (21.1) 

The solution of this equation is more conveniently found in complex 
form, and so we replace cos yt on the right by exp(— iyt): 

x+2Xx+colx = if/m) exp ( — iyt). 

We seek a particular integral in the form x = B exp( — iyt), obtaining 
for B the value 

B = f/m(c4 -f -lay). (21.2) 
Writing B = b exp(— id), we have 

b = f/m Vl« -y 2 f+ 4AV ], tan d - 2ly/(y* -tag). (21.3) 

Finally, taking the real part of the expression B exp (—iyt) = 
= b exp [— i(yt+S)], we find the particular integral of equation (21.1); 
adding to this the general solution of that equation with zero on the 
right-hand side (and taking for definiteness the case co 0 >■ A), we have 

x = aexp(— Xt) cos (cot+ a) +b cos (yt+ d). (21.4) 
The first term decreases exponentially with time, so that, after a 
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sufficient time, only the second term remains : 

x = bcos(yt+8). (21.5) 

The expression (21.3) for the amplitude b of the forced oscillation 
increases as y approaches co 0 , but does not become infinite as it does 
in resonance without friction. For a given amplitude / of the force, the 
amplitude of the oscillations is greatest when y = ^/O^o - 2A 2 ) ; for 
A <k co 0 , this differs from co 0 only by a quantity of the second order of 
smallness. 

Let us consider the range near resonance, putting y = co 0 + e with 
e small, and suppose also that A «: co 0 . Then we can approximately 
put, in (21.2), y 2 —a>l = (y+co 0 )(y—co 0 ) % 2co 0 e, lily =s 2/Aco 0 , so 
that 

B= -f/2m(e+iX)(o 0 (21.6) 

or 

b =f/2m(OoV(e 2 +X l ), tan 6 = X/e. (21.7) 
A property of the phase difference d between the oscillation and the 
external force is that it is always negative, i.e. the oscillation "lags 
behind" the force. Far from resonance on the side y < co 0 , £ — 0; 
on the side y > co 0 , # —it. The change of b from zero to — n takes 
place in a frequency range near co 0 which is narrow (of the order of 
A in width); d passes through — -|« when y = a> 0 . In the absence of 
friction, the phase of the forced oscillation changes discontinuously 
byn at y = co 0 (the second term in (18.4) changes sign); when friction 
is allowed for, this discontinuity is smoothed out. 

In steady motion, when the system executes the forced oscillations 
given by (21.5), its energy remains unchanged. Energy is continually 
absorbed by the system from the source of the external force and 
dissipated by friction. Let I(y) be the mean amount of energy absorbed 
per unit time, which depends on the frequency of the external force. 
By (20.13) we have I(y) = 2F, where F is the average value (over the 
period of oscillation) of the dissipative function. For motion in one 
dimension, the expression (20.11) for the dissipative function becomes 
F = ±<zx 2 = Aw* 2 . Substituting (21.5), we have 

F= Xmby sin 2 (yt+S). 
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The time average of the squared sine is \, so that 

I(y) = XmbY- (21.8) 

Near resonance we have, on substituting the amplitude of the 
oscillation from (21.7), 

I(e)=pX/4m(e*+X 2 ). (21.9) 

This is called a dispersion-type frequency dependence of the absorption. 
The half-width of the resonance curve (Fig. 18) is the value of |e| for 
which 1(e) is half its maximum value (e = 0). It is evident from (21.9) 




that in the present case the half-width is just the damping coefficient X. 
The height of the maximum is 7(0) = P/4mX, and is inversely pro- 
portional to X. Thus, when the damping coefficient decreases, the 
resonance curve becomes more peaked. The area under the curve, 
however, remains unchanged. This area is given by the integral 

oo oo 

jl(y)dy= jl(e)de. 

0 — »o 

Since 1(e) diminishes rapidly with increasing | e |, the region where | e | is 
large is of no importance, and the lower limit may be replaced by 
— «> , and 1(e) taken to have the form given by (21 .9). Then we have 



oo oo 

f PX C de 7if* 
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§22. Parametric resonance 

There exist oscillatory systems which are not closed, but in which 
the external action amounts only to a time variation of the para- 
meters.* 

The parameters of a one-dimensional system are the coefficients m 
and k in the Lagrangian (17.3). If these are functions of time, the 
equation of motion is 

4-(mx)+kx = 0. (22.1) 
at 

We introduce instead of / a new independent variable x such that 
dx = dtjm{t) ; this reduces the equation to 

d 2 x/dr 2 +mkx = 0. 

There is therefore no loss of generality in considering an equation of 
motion of the form 

d 2 x/dt 2 +co\t)x = 0 (22.2) 

obtained from (22. 1) if m = constant. 

The form of the function co(t) is given by the conditions of the 
problem. Let us assume that this function is periodic with some 
frequency y and period T = 2n/y. This means that co(t+T) — eo(t), 
and so the equation (22.2) is invariant under the transformation 
/ -► t+T. Hence, if x(t) is a solution of the equation, so is xft+T). 
That is, if xi(t) and x 2 (t) are two independent integrals of equation 
(22.2), they must be transformed into linear combinations of them- 
selves when / is replaced by t+T. It is possible J to choose Xi and x 2 
in such a way that, when / -* t+T, they are simply multiplied by 
constants: Xi(t+T) = fiiXi(f), x 2 (t+T) = /^^W- The most general 
functions having this property are 

Xl (t) = /ifn^t), x 2 (t) = ,xfii 2 (t), (22.3) 

where i7i(/), II 2 (t) are purely periodic functions of time with period T. 

t A simple example is that of a pendulum whose point of support executes a 
given periodic motion in a vertical direction (see Problem). 
t Provided that the constants /(, and /( 2 are not equal. 
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The constants /ai and fi2 in these functions must be related in a 
certain way. Multiplying the equations xi+co 2 (t)xi = 0, X2+a> 2 (t)x2 = 
= 0 by x 2 and x\ respectively and subtracting, we have xix 2 — x 2 xi = 
= d(xiX2—xix 2 )ldt = 0, or 

x\X2—x\Xi = constant. (22.4) 

For any functions x\(t), x 2 {t) of the form (22.3), the expression on the 
left-hand side of (22.4) is multiplied by fj,^i 2 when t is replaced by 
t+T. Hence it is clear that, if equation (22.4) is to hold, we must 
have 

= 1. (22.5) 

Further information about the constants fi 2 can be obtained 
from the fact that the coefficients in equation (22.2) are real. If x(t) 
is any integral of such an equation, then the complex conjugate 
function x*(t) must also be an integral. Hence it follows that {ii, p 2 
must be the same as /i*, fi\, i.e. either ^ = ^ or ^ and fi 2 are 
both real. In the former case, (22.5) gives ^ = 1//^, i.e. I//J 2 = 
= \/j,2 | 2 = 1 : the constants fii and [i2 are of modulus unity. 

In the other case, two independent integrals of equation (22.2) are 

Xl (t) = ^n^t), x 2 (t) = r" T n2{t\ (22.6) 

with a positive or negative real value of fi ^ 1). One of these 
functions (*i or x 2 according as | \ > 1 or | fi | < 1) increases exponen- 
tially with time. This means that the system at rest in equilibrium 
(x = 0) is unstable: any deviation from this state, however small, is 
sufficient to lead to a rapidly increasing displacement x. This is called 
parametric resonance. 

It should be noticed that, when the initial values of x and x are 
exactly zero, they remain zero, unlike what happens in ordinary 
resonance (§18), in which the displacement increases with time (pro- 
portionally to /) even from initial values of zero. 

Let us determine the conditions for parametric resonance to occur in 
the important case where the function co(t) differs only slightly from a 
constant value co 0 and is a simple periodic function: 

0,2(0 = (ug(l+ h cos yt), (22.7) 
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where the constant h <sc 1 ; we shall suppose h positive, as may always 
be done by suitably choosing the origin of time. As we shall see below, 
parametric resonance is strongest if the frequency of the function 
co(t) is nearly twice <w 0 . Hence we put y = 2co 0 +e, where e <k <w 0 . 
The solution of the equation of motion 

x+ool[l+hcos(2to 0 + e)t]x = 0 (22.8) 

may be sought in the form 

x = a(t)cos(a>o+\e)t+b(t)sm((o 0 + \e)t, (22.9) 

where a{t) and b(t) are functions of time which vary slowly in compari- 
son with the trigonometrical factors. This form of solution is, of 
course, not exact. In reality, the function x(t) also involves terms with 
frequencies which differ from <w 0 +ye by integral multiples of 2<w 0 +f ; 
these terms are, however, of a higher order of smallness with respect 
to h, and may be neglected in a first approximation. 

The values of y which divide the regions of instability and stability 
correspond to /x = 1 in (22.6) and to time-independent coefficients a 
and b in (22.9). Thus the determination of the limits of the resonance 
region amounts to finding the values of y (or, equivalently, those of e) 
for which the equation of motion is satisfied (with the necessary 
accuracy) by the solution (22.9) with constant a and b. 

We substitute (22.9) in (22.8). The products of trigonometrical 
functions may be replaced by sums : 

cos(<u 0 +^e)f cos(2<u 0 + e)f = \ cos 3{oi 0 -\-\e)t-\-\ cos (<u 0 + ye)f, 

etc., and in accordance with what was said above we omit terms with 
frequency 3(<u 0 +ie). The result is 

b(e + \hcoo) sin (coo + ^e)t+a(e—Yhco 0 ) cos (a>o + Y£)t = 0. 

If this equation is to be satisfied, the coefficients of the sine and cosine 
must both be zero. Thus either e = — \hco 0 and a = 0, or e = ^hco 0 
and b = 0. These values of e give the limits of the region of parametric 
resonance. Thus parametric resonance occurs in the range 

-\ho) 0 < e < |7ttUo (22.10) 

on either side of the frequency 2co 0 . 
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Parametric resonance also occurs when the frequency y is close to 
any value 2<do/h with n integral. The width of the resonance range 
decreases rapidly with increasing n, however, namely as ft". 

PROBLEM 

Find the conditions for parametric resonance in small oscillations of a simple 
pendulum whose point of support oscillates vertically. 

Solution. The Lagrangian derived in §5, Problem 2, gives for small oscillations 
(cjxa I) the equation of motion <£+a>o[l+(4a//) cos(2tu 0 +e)/]</> = 0, where 
wo = g/l- Hence we see that the parameter h is here represented by 4a//. The 
condition (22.10) becomes |e| < la V(«7/ s ). 

§23. Anharmonic oscillations 

The whole of the theory of small oscillations discussed above is 
based on the expansion of the potential and kinetic energies of the 
system in terms of the coordinates and velocities, retaining only the 
second-order terms. The equations of motion are then linear, and in 
this approximation we speak of linear oscillations. Although such an 
expansion is entirely legitimate when the amplitude of the oscillations 
is sufficiently small, in higher approximations (called anharmonic or 
non-linear oscillations) some minor but qualitatively different prop- 
erties of the motion appear. 

Let us consider the expansion of the Lagrangian as far as the third- 
order terms. In the potential energy there appear terms of degree three 
in the coordinates x t , and in the kinetic energy terms containing 
products of velocities and coordinates, of the form ipc^. This 
difference from the previous expression (19.3) is due to the retention 
of terms linear in x in the expansion of the functions a ik {q). Thus the 
Lagrangian is of the form 

L = \Y, (m i kXiX k -ki k x i x k )+ 

i.k 

+ 1 £ n ik ,XiX k x,-\ Y, l MXiX k xi, (23.1) 
where n ikl , l ik , are further constant coefficients. 



§23 Anharmonic oscillations 75 

If we change from arbitrary coordinates x, to the normal coordinates 
of the linear approximation, then, because this transformation is 

linear, the third and fourth sums in (23.1) become similar sums with 
and Q a in place of the coordinates x t and the velocities x t . Denoting 

the coefficients in these new sums by X x p y and we have the Lagran- 

gian in the form 

<x a,/9, y a, /J, y 

(23.2) 

We shall not pause to write out in their entirety the equations of 
motion derived from this Lagrangian. The important feature of these 
equations is that they are of the form 

fi. + «£G.=/.(fi, fi.fi), (23.3) 

where / a are homogeneous functions, of degree two, of the coordinates 
Q and their time derivatives. 

Using the method of successive approximations, we seek a solution 
of these equations in the form 

fi. - fif'+GS 0 , (23.4) 

where fi£ 2) <sc Q&\ and the fi^ 1) satisfy the "unperturbed" equations 
Q^P+colQp = 0, i.e. they are ordinary harmonic oscillations: 

fiW = a a cos (coj+xj. (23.5) 

Retaining only the second-order terms on the right-hand side of 
(23.3) in the next approximation, we have for the fi£ 2) the equations 

fii 2) + « 2 fii 2) = /«(fi (1) , fi (1) , fi (1) ), (23.6) 

where (23.5) is to be substituted on the right. This gives a set of in- 
homogeneous linear differential equations, in which the right-hand 
sides can be represented as sums of simple periodic functions. For 
example, 

= a a a 0 cos (to J +<*„) cos (oi p t+x p ) 

= i^{cOS [((O a + (Op)t + X a + Xp] + COS [(C0*-C0p)t + X x -Xp]}. 
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Thus the right-hand sides of equations (23.6) contain terms corre- 
sponding to oscillations whose frequencies are the sums and differences 
of the eigenfrequencies of the system. The solution of these equations 
must be sought in a form involving similar periodic factors, and so we 
conclude that, in the second approximation, additional oscillations 
with frequencies 

co a ± co fi , (23.7) 

including the double frequencies 2<u a and the frequency zero (corre- 
sponding to a constant displacement), are superposed on the normal 
oscillations of the system. These are called combination frequencies. 
The corresponding amplitudes are proportional to the products ajip 
(or the squares a^) of the corresponding normal amplitudes. 

In higher approximations, when further terms are included in the 
expansion of the Lagrangian, combination frequencies occur which 
are the sums and differences of more than two co^; and a further 
phenomenon also appears. In the third approximation, the combina- 
tion frequencies include some which coincide with the original 
frequencies <u a ( = co x +co fi —co fi ). When the method described above is 
used, the right-hand sides of the equations of motion therefore include 
resonance terms, which lead to terms in the solution whose amplitude 
increases with time. It is physically evident, however, that the magnitude 
of the oscillations cannot increase of itself in a closed system with no 
external source of energy. 

In reality, the fundamental frequencies <u a in higher approximations 
are not equal to their "unperturbed" values <u< 0) which appear in the 
quadratic expression for the potential energy. The increasing terms in 
the solution arise from an expansion of the type 

cos (<u< 0) + Atujf « cos co^t-tAco a sin <u< 0) /, 

which is obviously not legitimate when t is sufficiently large. 
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§24. Angular velocity 

A rigid body may be defined in mechanics as a system of particles 
such that the distances between the particles do not vary. This condi- 
tion can, of course, be satisfied only approximately by systems which 
actually exist in nature. The majority of solid bodies, however, change 
so little in shape and size under ordinary conditions that these changes 
may be entirely neglected in considering the laws of motion of the body 
as a whole. 

In what follows, we shall often simplify the derivations by regarding 
a rigid body as a discrete set of particles, but this in no way invalidates 
the assertion that solid bodies may usually be regarded in mechanics 
as continuous, and their internal structure disregarded. The passage 
from the formulae which involve a summation over discrete particles 
to those for a continuous body is effected by simply replacing the mass 
of each particle by the mass q dV contained in a volume element dV 
(p being the density) and the summation by an integration over the 
volume of the body. 

To describe the motion of a rigid body, we use two systems of 
coordinates : a "fixed" (i.e. inertial) system XYZ, and a moving system 
xi = x, x 2 = y, x 3 = z which is supposed to be rigidly fixed in the 
body and to participate in its motion. The origin of the moving system 
may conveniently be taken to coincide with the centre of mass of the 
body. 
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The position of the body with respect to the fixed system of coordi- 
nates is completely determined if the position of the moving system is 
specified. Let the origin O of the moving system have the position R 0 
(Fig. 19). The orientation of the axes of that system relative to the 
fixed system is given by three independent angles, which together with 
the three components of the vector Ro make six coordinates. Thus a 
rigid body is a mechanical system with six degrees of freedom. 

Let us consider an arbitrary infinitesimal displacement of a rigid 
body. It can be represented as the sum of two parts. One of these is an 
infinitesimal translation of the body, whereby the centre of mass moves 
to its final position, but the orientation of the axes of the moving system 
of coordinates is unchanged. The other is an infinitesimal rotation 
about the centre of mass, whereby the remainder of the body moves to 
its final position. 

Let r be the position of an arbitrary point P in a rigid body in the 
moving system, and R the position of the same point in the fixed system 
(Fig. 19). Then the infinitesimal displacement dR of P consists of a 
displacement dR 0 , equal to that of the centre of mass, and a displace- 
ment d<J>Xr relative to the centre of mass resulting from a rotation 
through an infinitesimal angle d<f> (see (9.1)): dR = dR 0 +d<J)Xr. 
Dividing this equation by the time dt during which the displacement 
occurs, and putting 

dR/d< = v, dRo/d/ = V, <fy/d/ = SI, (24.1) 
we obtain the relation 

v = V+S2Xr. (24.2) 

The vector V is the velocity of the centre of mass of the body, and 
is also the translational velocity of the body. The vector SI is called 
the angular velocity of the rotation of the body; its direction, like that 
of d<J>, is along the axis of rotation. Thus the velocity v of any point 
in the body relative to the fixed system of coordinates can be expressed 
in terms of the translational velocity of the body and its angular 
velocity of rotation. 

It should be emphasised that, in deriving formula (24.2), no use 
has been made of the fact that the origin is located at the centre of 
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z 




Fia. 19 



mass. The advantages of this choice of origin will become evident 
when we come to calculate the energy of the moving body. 

Let us now assume that the system of coordinates fixed in the body is 
such that its origin is not at the centre of mass O, but at some point O' 
at a distance a from O. Let the velocity of O' be V, and the angular 
velocity of the new system of coordinates be Si'. We again consider 
some point P in the body, and denote by r' its position with respect 
to O'. Then r = r' + a, and substitution in (24.2) gives v = V+S2X 
Xa+£2Xr'. The definition of V and Si' shows that v = \'+Si' Xr'. 
Hence it follows that 



The second of these equations is very important. We see that the 
angular velocity of rotation, at any instant, of a system of coordinates 
fixed in the body is independent of the particular system chosen. All 
such systems rotate with angular velocities SI which are equal in 
magnitude and parallel in direction. This enables us to call SI the 
angular velocity of the body. The velocity of the translational motion, 
however, does not have this "absolute" property. 

It is seen from the first formula (24.3) that, if V and SI are, at any 
given instant, perpendicular for some choice of the origin O, then V 
and SI' are perpendicular for any other origin O'. Formula (24.2) 
shows that in this case the velocities v of all points in the body are 



V = 



V+£2Xa, Si' = Si. 



(24.3) 
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in a plane perpendicular to SI. It is then always possible* to choose 
an origin O' whose velocity V is zero, so that the motion of the body 
at the instant considered is a pure rotation about an axis through O'. 
This axis is called the instantaneous axis of rotation.% 

In what follows we shall always suppose that the origin of the mov- 
ing system is taken to be at the centre of mass of the body, and so the 
axis of rotation passes through the centre of mass. In general both the 
magnitude and the direction of Si vary during the motion. 

§25. The inertia tensor 

To calculate the kinetic energy of a rigid body, we may consider it 
as a discrete system of particles and put T = Z\mv 2 , where the sum- 
mation is taken over all the particles in the body. Here, and in what 
follows, we simplify the notation by omitting the suffix which denumer- 
ates the particles. 

Substitution of (24.2) gives 

T = £ \-m(y+SlXrf = £iroKH£mV-flXr+Xi™(flXr) a . 

The velocities V and SI are the same for every point in the body. In the 
first term, therefore, \Y* can be taken outside the summation sign, 
and Em is just the mass of the body, which we denote by p. In the 
second term we put Zm\ -SiXr = 2mr-VXS2 = YxSi-Zmr. Since 
we take the origin of the moving system to be at the centre of mass, 
this term is zero, because Emr = 0. Finally, in the third term we 
expand the squared vector product. The result is 

T= i^+iXmpVMfl.rn (25.1) 

Thus the kinetic energy of a rigid body can be written as the sum of 
two parts. The first term in (25. 1) is the kinetic energy of the translation- 



t O' may, of course, lie outside the body. 

% In the general case where V and £2 are not perpendicular, the origin may be 
chosen so as to make V and £2 parallel, i.e. so that the motion consists (at the instant 
in question) of a rotation about some axis together with a translation along that 
axis. 
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al motion, and is of the same form as if the whole mass of the body 
were concentrated at the centre of mass. The second term is the kinetic 
energy of the rotation with angular velocity SI about an axis passing 
through the centre of mass. It should be emphasised that this division 
of the kinetic energy into two parts is possible only because the origin 
of the coordinate system fixed in the body has been taken to be at its 
centre of mass. 

We may rewrite the kinetic energy of rotation in tensor form, i.e. 
in terms of the components t x t and £2, of the vectors r and SI. We have 

r„, = iXi»KW-fii*A*ft) 

= I^A E m (rf&ik -x t x k ). 

Here we have used the identity Q ( . = S jk Q k , where d lk is the unit 
tensor, whose components are unity for i = k and zero for / k. In 
terms of the tensor 

4 = E ffl (^ -xix k ) (25.2) 

we have finally the following expression for the kinetic energy of a 
rigid body : 

T= l/iFHlZ/AA. (25.3) 

The Lagrangian for a rigid body is obtained from (25.3) by subtracting 
the potential energy : 

L = -\fiV*+ f /, AA - U. (25.4) 

The potential energy is in general a function of the six variables which 
define the position of the rigid body, e.g. the three coordinates X, Y, Z 
of the centre of mass and the three angles which specify the relative 
orientation of the moving and fixed coordinate axes. 



t The letters /, k, I are tensor suffixes and take the values 1, 2, 3. The summation 
rule will always be used, i.e. summation signs are omitted, but summation over the 
values 1, 2, 3 is implied whenever a suffix occurs twice in any expression. Such a 
suffix is called a dummy suffix. For example, A,B, = A-B, A} = A X A X = A 2 , etc. 
It is obvious that dummy suffixes can be replaced by any other like suffixes, except 
ones which already appear elsewhere in the expression concerned. 
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The tensor I jk is called the inertia tensor of the body. It is symmet- 
rical, i.e. 

hk = hi, (25.5) 

as is evident from the definition (25.2). For clarity, we may give its 
components explicitly : 



hk = 



£ m(y 2 + z 2 ) — £ mxy — £ mxz 
— £ myx £ m(x 2 + z 2 ) /mj>z 
. — £ mzx —J^mzy £ w^+j 2 ). 



(25.6) 



The components V^, 7^,, /„ are called the moments of inertia about the 
corresponding axes. 

The inertia tensor is evidently additive: the moments of inertia of a 
body are the sums of those of its parts. 

If the body is regarded as continuous, with density q, the sum in 
the definition (25.2) becomes an integral over the volume of the body : 

hk = j Q(xf8i k - x t x k ) d V. (25.7) 

Like any symmetrical tensor of rank two, the inertia tensor can be 
reduced to diagonal form by an appropriate choice of the directions 
of the axes xi, x 2 , x 3 . These directions are called the principal axes of 
inertia, and the corresponding values of the diagonal components of 
the tensor are called the principal moments of inertia; we shall denote 
them by h, h, h. When the axes x u x 2 , x 3 are so chosen, the kinetic 
energy of rotation takes the very simple form 

T [0t = KAQH/^I+M- (25.8) 

None of the three principal moments of inertia can exceed the sum 
of the other two. For instance, 

/! + /, = X m{x\ + x 2 + 2x 2 ) ^ X m(xl +x 2 ) - 7 3 . (25.9) 

A body whose three principal moments of inertia are all different 
is called an asymmetrical top. If two are equal (h = h ^ h), we have 
a symmetrical top. In this case the direction of one of the principal 
axes in the x\Xi plane may be chosen arbitrarily. If all three principal 
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moments of inertia are equal, the body is called a spherical top, and 
the three axes of inertia may be chosen arbitrarily as any three mutually 
perpendicular axes. 

The determination of the principal axes of inertia is much simplified 
if the body is symmetrical, for it is clear that the position of the centre 
of mass and the directions of the principal axes must have the same 
symmetry as the body. For example, if the body has a plane of sym- 
metry, the centre of mass must lie in that plane, which also contains 
two of the principal axes of inertia, while the third is perpendicular 
to the plane. An obvious case of this kind is a coplanar system of 
particles. Here there is a simple relation between the three principal 
moments of inertia. If the plane of the system is taken as the x\Xi 
plane, then x 3 = 0 for every particle, and so I t = Emx\, I 2 = Emx\, 
I 3 = Zm(xl+xl), whence 

h = h+h. (25.10) 

If a body has an axis of symmetry of any order, the centre of mass 
must lie on that axis, which is also one of the principal axes of inertia, 
while the other two are perpendicular to it. If the axis is of order 
higher than the second, the body is a symmetrical top. For any prin- 
cipal axis perpendicular to the axis of symmetry can be turned through 
an angle different from 180° about the latter, i.e. the choice of the 
perpendicular axe's is not unique, and this can happen only if the body 
is a symmetrical top. 

A particular case here is a collinear system of particles. If the line 
of the system is taken as the x 3 axis, then xi = x 2 = 0 for every 
particle, and so two of the principal moments of inertia are equal 
and the third is zero : 

4 = 4 = £»«§. 4 = 0. (25.11) 

Such a system is called a rotator. The characteristic property which 
distinguishes a rotator from other bodies is that it has only two, not 
three, rotational degrees of freedom, corresponding to rotations 
about the x\ and x 2 axes: it is clearly meaningless to speak of the rota- 
tion of a straight line about itself. 
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Finally, we may note one further result concerning the calculation 
of the inertia tensor. Although this tensor has been defined with 
respect to a system of coordinates whose origin is at the centre of 
mass (as is necessary if the fundamental formula (25.3) is to be valid), 
it may sometimes be more conveniently found by first calculating a 
similar tensor l' ik = £m(x',' z d !k — x\x' k ), defined with respect to some 
other origin O'. If the distance 00' is represented by a vector a, then 
r = r'+a, x, = x.+a,; since, by the definition of O, Emr = 0, we 
have 

I' ik = I ik + ti(a*b ik -a iak ). (25.12) 
Using this formula, we can easily calculate I ik if l' ik is known. 

PROBLEMS 

Problem 1. Determine the principal moments of inertia for the following types 
of molecule, regarded as systems of particles at fixed distances apart : (a) a molecule 
of three collinear atoms, (b) a triatomic molecule which is an isosceles triangle 
(Fig. 20). 




Fig. 20 

Solution, (a) 

h = h = — (m 1 OT 2 /fj + OT 1 OT,/fs + OT 2 OT s /|,), 1 3 = 0, 

r 

where m, is the mass of the ath atom and the distance between the ath and 6th 
atoms. 

For a diatomic molecule the result is obvious : it is the product of the reduced 
mass of the two atoms and the square of the distance between them: I l — I% = 

(b) The centre of mass is on the axis of symmetry of the triangle, at a distance 
m 2 h/fi from its base (h being the height of the triangle). The moments of inertia are 
li = 2mim s ft i /fi, I s = j-miO*, I» = h + h- 
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Problem 2. Determine the principal moments of inertia for the following 
homogeneous bodies: (a) a thin rod of length /, (b) a sphere of radius R, (c) a 
circular cylinder of radius R and height h, (d) a rectangular parallelepiped of sides 
a, b, and c, (e) an ellipsoid of semiaxes a, b, c. 

Solution, (a) /j = / 2 = * 2 fiP, I 3 = 0 (we neglect the thickness of the rod). 

(b) Ii = / 2 = / 3 = %fiR 2 (found by calculating the sum It + I^+Is = 2g f r 2 dV). 

(c) /i = / 2 = }/it(/{ 2 + iA 2 ), / 3 = 1-jU/? 2 (where the x 3 axis is along the axis of the 
cylinder). 

(d) /j = ^fi(b 2 + c 2 ), 1 a = ^a'+c ! ), / 3 = ^/4.a 2 + b 2 ) (where the axes x l3 
x 2 , x 3 are along the sides a, b, c respectively). 

(e) The centre of mass is at the centre of the ellipsoid, and the principal axes of 
inertia are along the axes of the ellipsoid. The integration over the volume of the 
ellipsoid can be reduced to one over a sphere by the transformation x = aS, 
y = br), z = cC, which converts the equation of the surface of the ellipsoid x 2 /a 2 + 
+y 2 lb 2 + z 2 /c 2 = 1 into that of the unit sphere { 2 + i? 2 + £ 2 = 1. 

For example, the moment of inertia about the x axis is 

h = Q $$$(y 2 + z*)dxdydz 

= g abc J* J* J* (&V+c 2 C 8 ) df di? df 
= i a 6c/'(6 2 +c 2 ), 

where /' is the moment of inertia of a sphere of unit radius. Since the volume of the 
ellipsoid is 47ia6c/3, we find the moments of inertia /j = \^,{b 2 +c 2 ), I 2 = ~[i(a 2 + c 2 ), 
h = lMa 2 +6 2 ). 

Problem 3. Determine the frequency of small oscillations of a compound 
pendulum (a rigid body swinging about a fixed horizontal axis in a gravitational 
field). 

Solution. Let / be the distance between the centre of mass of the pendulum and 
the axis about which it rotates, and a, /S, y the angles between the principal axes of 
inertia and the axis of rotation. We take as the variable coordinate the angle <p 
between the vertical and a line through the centre of mass perpendicular to the 
axis of rotation. The velocity of the centre of mass is V = /</>, and the components 
of the angular velocity along the principal axes of inertia are <j> cos a, <j> cos /S, 
<j> cos y. Assuming the angle <p to be small, we find the potential energy U = /igl(l - 
— cos </>) rs ifiglcj) 2 . The Lagrangian is therefore 

L = \nl 2 4> 2 + Wi cos 2 « + h cos 2 |S+/ 3 cos 2 y)^-\ngl^. 

The frequency of the oscillations is consequently 

co 2 = figlKfiP + It cos 2 a + / 2 cos 2 y?+/ 3 cos 2 y). 

Problem 4. Find the kinetic energy of the system shown in Fig. 21 : OA and AB 
are thin uniform rods of length / hinged together at A. The rod OA rotates (in the 
plane of the diagram) about O, while the end B of the rod AB slides along Ox. 
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Fig. 21 

Solution. The velocity ofthe centre of mass of the rod OA(y/hich is at the middle 
of the rod) is |Y$>, where is the angle AOB. The kinetic energy of the rod OA is 
therefore T x = |/i/ 2 <)!> 2 +|7<£ 2 , where fi is the mass of each rod. 

The Cartesian coordinates of the centre of mass of the rod AB are X = 
f / cos <£, Y = ~l sin 4>. Since the angular velocity of rotation of this rod is also 
its kinetic energy is T 2 = \n(X*+ Y 2 )+\l4> 2 = ^/ 2 (1 + 8 sin 2 <£)<£ 2 +i/<£ 2 . The total 
kinetic energy of this system is therefore T = J/t/ 2 (l + 3 sin 2 4>)(j> 2 , since / = ^fil 2 
(see Problem 2(a)). 

Problem 5. Find the kinetic energy of a cylinder of radius R rolling on a plane, 
if the mass of the cylinder is so distributed that one of the principal axes of inertia 
is parallel to the axis of the cylinder and at a distance a from it, and the moment 
of inertia about that principal axis is /. 

Solution. Let be the angle between the vertical and a line from the centre of 
mass perpendicular to the axis of the cylinder (Fig. 22). The motion of the cylinder 
at any instant may be regarded as a pure rotation about an instantaneous axis which 
coincides with the line where the cylinder touches the plane. The angular velocity 
of this rotation is <£, since the angular velocity of rotation about all parallel axes is the 
same. The centre of mass is at a distance y/(a i +R 2 -2aR cos <£) from the instanta- 




Fig. 22 
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neous axis, and its velocity is therefore V = <j> V(a 2 +^ 2 -2a^cos0). The total 
kinetic energy is 

T = }n(a 2 + R 2 - 2aR cos 0)0 2 + {/9b 2 . 

Problem 6. Find the kinetic energy of a homogeneous cylinder of radius a rolling 
inside a cylindrical surface of radius R (Fig. 23). 




Fig. 23 



Solution. We use the angle between the vertical and the line joining the 
centres of the cylinders. The centre of mass of the rolling cylinder is on the axis, 
and its velocity is V = <f>(R-a). We can calculate the angular velocity as that of a 
pure rotation about an instantaneous axis which coincides with the line of contact 
of the cylinders; it is O = Via = cj>(R-a)la. If /, is the moment of inertia about 
the axis of the cylinder, then 

T= iM^-a) 2 0 2 +W-a) 2 0 2 /a 2 = ifi(R-a)*<j>\ 
/ a being given by Problem 2(c). 

§26. Angular momentum of a rigid body 

The value of the angular momentum of a system depends, as we 
know, on the point with respect to which it is defined. In the mechanics 
of a rigid body, the most appropriate point to choose for this purpose 
is the origin of the moving system of coordinates, i.e. the centre of 
mass of the body, and in what follows we shall denote by M the 
angular momentum so defined. 

According to formula (9.6), when the origin is taken at the centre 
of mass of the body, the angular momentum M is equal to the "intrin- 
sic" angular momentum resulting from the motion relative to the 
centre of mass. In the definition M = 27mrXv we therefore replace 
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vbyflXr: 

M = X>rX(flXr) = £ m[r z Sl -r(r-fl)], 

or, in tensor notation, 

M i = E m(rf&i- x i x A) = &kY, m ( x 'i ?> ik-XiXk)- 

Finally, using the definition (25.2) of the inertia tensor, we have 

Mi = I ik n k . (26.1) 

If the axes xi, x 2 , x 3 are the same as the principal axes of inertia, 
formula (26.1) gives 

M 1 = I 1 tl 1 , M 2 = / 2 Q 2 , M 3 = / 3 Q 3 . (26.2) 

In particular, for a spherical top, where all three principal moments 
of inertia are equal, we have simply 

M = ISl, (26.3) 

i.e. the angular momentum vector is proportional to, and in the same 
direction as, the angular velocity vector. For an arbitrary body, how- 
ever, the vector M is not in general in the same direction as SI ; this 
happens only when the body is rotating about one of its principal 
axes of inertia. 

Let us consider a rigid body moving freely, i.e. not subject to any 
external forces. We suppose that any uniform translational motion, 
which is of no interest, is removed, leaving a free rotation of the body. 

As in any closed system, the angular momentum of the freely 
rotating body is constant. For a spherical top the condition M = con- 
stant gives SI = constant; that is, the most general free rotation of a 
spherical top is a uniform rotation about an axis fixed in space. 

The case of a rotator is equally simple. Here also M = ISl, and the 
vector SI is perpendicular to the axis of the rotator. Hence a free 
rotation of a rotator is a uniform rotation in one plane about an axis 
perpendicular to that plane. 

The law of conservation of angular momentum also suffices to 
determine the more complex free rotation of a symmetrical top. Using 
the fact that the principal axes of inertia xi, Xi (perpendicular to the 
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axis of symmetry (x 3 ) of the top) may be chosen arbitrarily, we take 
the x 2 axis perpendicular to the plane containing the constant vector 
M and the instantaneous position of the x 3 axis. Then M 2 = 0, and 
formulae (26.2) show that i2 2 = 0. This means that the directions of 
M, SI and the axis of the top are at every instant in one plane (Fig. 24). 

M 




Fig. 24 



Hence, in turn, it follows that the velocity v = SI X r of every point on 
the axis of the top is at every instant perpendicular to that plane. That 
is, the axis of the top rotates uniformly (see below) about the direction 
of M, describing a circular cone. This is called regular precession of 
the top. At the same time the top rotates uniformly about its own axis. 

The angular velocities of these two rotations can easily be expressed 
in terms of the given angular momentum M and the angle 6 between 
the axis of the top and the direction of M. The angular velocity of 
the top about its own axis is just the component Q 3 of the vector SI 
along the axis : 

Cl 3 = M 3 \I 3 = (M/h) cos 6. (26.4) 

To determine the rate of precession i2 pr , the vector SI must be resolved 
into components along x 3 and along M. The first of these gives no 
displacement of the axis of the top, and the second component is 
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therefore the required angular velocity of precession. Fig. 24 shows 
that Q pr sin 6 = Qi, and, since Qi = Mi/h = (M/h) sin 6, we have 

£2 pr = M/h. (26.5) 

§27. The equations of motion of a rigid body 

Since a rigid body has, in general, six degrees of freedom, the general 
equations of motion must be six in number. They can be put in a 
form which gives the time derivatives of two vectors, the momentum 
and the angular momentum of the body. 

The first equation is obtained by simply summing the equations 
p — f for each particle in the body, p being the momentum of the 
particle and f the force acting on it. In terms of the total momentum 
of the body P = i^p = fi\ and total force acting on it F = Hi, we 
have 

dP/d* = F. (27.1) 

Although F has been defined as the sum of all the forces f acting 
on the various particles, including the forces due to other particles, 
F actually includes only external forces: the forces of interaction 
between the particles composing the body must cancel out, since if there 
are no external forces the momentum of the body, like that of any 
closed system, must be conserved, i.e. we must have F = 0. 

If U is the potential energy of a rigid body in an external field, the 
force F is obtained by differentiating U with respect to the coordinates 
of the centre of mass of the body : 

F = -9£//8R 0 . (27.2) 

For, when the body undergoes a translation through a distance <5R 0 , 
the position vector R of every point in the body changes by 6R 0 , and 
so the change in the potential energy is 

dU = £(3t//8R)-<5R = <5R 0 -Xd£//dR = -3R 0 -£f = -F-3R 0 . 

Let us now derive the second equation of motion, which gives the 
time derivative of the angular momentum M. To simplify the deriva- 
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tion, it is convenient to choose the "fixed" (inertial) frame of reference 
in such a way that the centre of mass is at rest in that frame at the 
instant considered. The equation of motion thus derived is valid in 
any other inertial frame, by Galileo's relativity principle. 

We have M = (d/dO^rXp = 2r Xp+.£rXp. Our choice of the 
frame of reference (with V = 0) means that the value of r at the instant 
considered is the same as v = R. Since the vectors v and p = rm are 
parallel, rXp = 0. Replacing p by the force f, we have finally 

dM/dr = K, (27.3) 

where 

K = £rxf. (27.4) 

The vector r X f is called the moment of the force f, and so K is 
the total torque, i.e. the sum of the moments of all the forces acting 
on the body. Like the total force F, the sum (27.4) need include only 
the external forces: by the law of conservation of angular momentum, 
the sum of the moments of the internal forces in a closed system must 
be zero. 

The moment of a force, like the angular momentum, in general 
depends on the choice of the origin about which it is defined. In (27.3) 
and (27.4) the moments are defined with respect to the centre of mass 
of the body. 

When the origin is moved a distance a, the new position vector r' of 
each point in the body is equal to r — a. Hence K = ZrXi = ZYxf+ 
+ ZaXfor 

K = K'+aXF. (27.5) 

Hence we see, in particular, that the value of the torque is independent 
of the choice of origin if the total force F = 0. In this case the body 
is said to be acted on by a couple. 

The change in the potential energy resulting from an infinitesimal 
rotation b§ of the body is 6U = -Zf dR = -Zt-dfyXr = -<5<j>. 
• Z'rXf = — K-d§, whence 

K = -3C//8<|), (27.6) 

which is analogous to formula (27.2) for the force. 
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Let us assume that the vectors F and K are perpendicular. Then a 
vector a can always be found such that K' given by formula (27.5) is 
zero and 

K = axF. (27.7) 

The choice of a is not unique, since the addition to a of any vector 
parallel to F does not affect equation (27.7). The condition K' = 0 
thus gives a straight line, not a point, in the moving system of coordi- 
nates. When K is perpendicular to F, the effect of all the applied forces 
can therefore be reduced to that of a single force F acting along this 
line. 

Such a case is that of a uniform field of force, in which the force on 
a particle is f = eE, with E a constant vector characterising the field 
and e characterising the properties of a particle with respect to the 
field. 1 Then F = E27e, K = ZctXE. Assuming that Ze ^ 0, we define 
a position r 0 such that 

ro = 5>£> (27.8) 

Then the total torque is simply 

K = r 0 XF. (27.9) 

Thus, when a rigid body moves in a uniform field, the effect of the 
field reduces to the action of a single force F applied at the point whose 
position is (27.8). The position of this point is entirely determined 
by the properties of the body itself. In a gravitational field, for example, 
it is the centre of mass. 



§28. Rigid bodies in contact 

The equations of motion (27.1) and (27.3) show that the conditions 
of equilibrium for a rigid body can be written as the vanishing of the 
total force and total torque on the body : 

F = £f=0, K = £rxf=0. (28.1) 

t For example, in a uniform electric field E is the field strength and e the charge; 
in a uniform gravitational field E is the acceleration g due to gravity and e is the 
mass m. 
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Here the summation is over all the external forces acting on the body, 
and r is the position of the "point of application"; the origin with 
respect to which the torque is defined may be chosen arbitrarily, since 
if F = 0 the value of K does not depend on this choice. 

If we have a system of rigid bodies in contact, the conditions (28.1) 
for each body separately must hold in equilibrium. The forces consid- 
ered must include those exerted on each body by those with which 
it is in contact. These forces at the points of contact are called reac- 
tions. It is obvious that the mutual reactions of any two bodies are 
equal in magnitude and opposite in direction. 

In general, both the magnitudes and the directions of the reactions 
are found by solving simultaneously the equations of equilibrium (28. 1) 
for all the bodies. In some cases, however, their directions are given by 
the conditions of the problem. For example, if two bodies can slide freely 
on each other, the reaction between them is normal to the surface. 

If two bodies in contact are in relative motion, dissipative forces of 
friction arise, in addition to the reaction. 

There are two possible types of motion of bodies in contact — sliding 
and rolling. In sliding, the reaction is perpendicular to the surfaces in 
contact, and the friction is tangential. Pure rolling, on the other hand, 
is characterised by the fact that there is no relative motion of the 
bodies at the point of contact; that is, a rolling body is at every instant 
as it were fixed to the point of contact. The reaction may be in any 
direction, i.e. it need not be normal to the surfaces in contact. The 
friction in rolling appears as an additional torque which opposes 
rolling. 

If the friction in sliding is negligibly small, the surfaces concerned 
are said to be perfectly smooth. If, on the other hand, only pure rolling 
without sliding is possible, and the friction in rolling can be neglected, 
the surfaces are said to be perfectly rough. 

In both these cases the frictional forces do not appear explicitly in 
the problem, which is therefore purely one of mechanics. If, on the 
other hand, the properties of the friction play an essential part in 
determining the motion, then the latter is not a purely mechanical 
process (cf. §20). 
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Contact between two bodies reduces the number of their degrees of 
freedom as compared with the case of free motion. Hitherto, in 
discussing such problems, we have taken this reduction into account 
by using coordinates which correspond directly to the actual number 
of degrees of freedom. In rolling, however, such a choice of coordi- 
nates may be impossible. 

The condition imposed on the motion of rolling bodies is that the 
velocities of the points in contact should be equal; for example, when 
a body rolls on a fixed surface, the velocity of the point of contact 
must be zero. In the general case, this condition is expressed by the 
equations of constraint, of the form 



where the c ai are functions of the coordinates only, and the suffix a 
denumerates the equations. If the left-hand sides of these equations 
are not the total time derivatives of some functions of the coordinates, 
the equations cannot be integrated. In other words, they cannot be 
reduced to relations between the coordinates only, which could be 
used to express the position of the bodies in terms of fewer coordinates, 
corresponding to the actual number of degrees of freedom. Such 
constraints are said to be non-holonomic, as opposed to holonomic 
constraints, which impose relations between the coordinates only. 

Let us consider, for example, the rolling of a sphere on a plane. As 
usual, we denote by V the translational velocity (the velocity of the 
centre of the sphere), and by SI the angular velocity of rotation. The 
velocity of the point of contact with the plane is found by putting 
r = -an in the general formula v = V+flXr; a is the radius of the 
sphere and n a unit vector along the normal to the plane. The required 
condition is that there should be no sliding at the point of contact, i.e. 



£ c «<7< = 0, 



(28.2) 



\-aSlXn = 0. 



(28.3) 



This cannot be integrated: although the velocity V is the total time 
derivative of the position vector of the centre of the sphere, the angular 
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velocity is not in general the total time derivative of any coordinate. 
The constraint (28.3) is therefore non-holonomic* 

There is a method of deriving the equations of motion for bodies in 
contact, in which the reactions are introduced explicitly. The essential 
feature of this method, which is sometimes called cTAlemberfs 
principle, is to write for each of the bodies in contact the equations 

dP/df = £ f, dM/d* = £ rxf, (28.4) 

wherein the forces f acting on each body include the reactions. The 
latter are initially unknown and are determined, together with the 
motion of the body, by solving the equations. This method is equally 
applicable for both holonomic and non-holonomic constraints. 



PROBLEMS 

Problem 1. Using d'Alembert's principle, find the equations of motion of a 
homogeneous sphere rolling on a plane under an external force F and torque K. 

Solution. The constraint equation is (28.3). Denoting the reaction force at the 
point of contact between the sphere and the plane by R, we have equations (28.4) 
in the form 

H dV/df = F+R, (1) 
/ dS2/df = K-anXR, (2) 

where we have used the facts that V = fi\ and, for a spherical top, M = ISl. 
Differentiating the constraint equation (28.3) with respect to time, we have V = 
aSlxn. Substituting in equation (1) and eliminating SI by means of (2), we obtain 
(//a,u)(F+R) = Kxn-aR+an(n-R), which relates R, F and K. Writing this 
equation in components and substituting / = j/m 2 (§25, Problem 2(b)), we have 

5 2 5 2 

= ~ ~Y = ~~7a~ Kx~-y- F y , R, = —F„ 

where the plane is taken as the xy plane. Finally, substituting these expressions in 
(1), we obtain the equations of motion involving only the given external force and 



t It may be noted that the similar constraint in the rolling of a cylinder is 
holonomic. In that case the axis of rotation has a fixed direction in space, and hence 
Q = d<£/d/ is the total derivative of the angle (j> of rotation of the cylinder about its 
axis. The condition (28.3) can therefore be integrated, and gives a relation between 
the angle <f> and the coordinate of the centre of mass. 
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torque: 




The components Q x , Q v of the angular velocity are given in terms of V x , V, by the 
constraint equation (28.3); for Q z we have the equation f/*a 2 dQJdt = K„ the 
z component of equation (2). 

Problem 2. A rod of weight P has one end A on a vertical plane and the other 
end B on a horizontal plane (Fig. 25), and is held in position by two horizontal 
strings AD and BC, the latter being in the same vertical plane as AB. Determine 
the reactions of the planes and the tensions in the strings. 




Fig. 25 



Solution. The tensions T A and T B are from A to D and from B to C respect- 
ively. The reactions R A and R B are perpendicular to the corresponding planes. The 
solution of the equations of equilibrium gives R B — P,T B = jP cot a, R A = T B 
sin 0, T A = T B cos 0. 

Problem 3. Two rods of length / and negligible weight are hinged together, and 
their ends are connected by a string AB (Fig. 26). They stand on a plane, and a 
force Fis applied at the midpoint of one rod. Determine the reactions. 

Solution. The tension T acts at A from A to B, and at B from B to A. The 
reactions R A and R B at A and B are perpendicular to the plane. Let R 0 be the 
reaction on the rod AC at the hinge; then a reaction — R© acts on the rod BC. 
The condition that the sum of the moments of the forces R a , Tand -R 0 acting on 
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Fio. 26 

the rod BC should be zero shows that R<, acts along BC. The remaining conditions 
of equilibrium (for the two rods separately) give R A = \F, R B = \F, R c = 
cosec a,T= fFcot a, where a is the angle CAB. 

§29. Motion in a non-inertial frame of reference 

Up to this point we have always used inertial frames of reference in 
discussing the motion of mechanical systems. For example, the Lagran- 
gian 

Lo = fmrg-£/, (29.1) 

and the corresponding equation of motion m dvo/df = —dU/dr, for 
a single particle in an external field are valid only in an inertial frame. 
(In this section the suffix 0 denotes quantities pertaining to an inertial 
frame.) 

Let us now consider what the equations of motion will be in a non- 
inertial frame of reference. The basis of the solution of this problem 
is again the principle of least action, whose validity does not depend 
on the frame of reference chosen. Lagrange's equations 
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are likewise valid, but the Lagrangian is no longer of the form (29.1), 
and to derive it we must carry out the necessary transformation of the 
function Lo- 

This transformation is done in two steps. Let us first consider a 
frame of reference K' which moves with a translational velocity \(t) 
relative to the inertial frame K 0 . The velocities v 0 and v' of a particle 
in the frames K 0 and K' respectively are related by 

vo - v'+V(r). (29.3) 

Substitution of this in (29.1) gives the Lagrangian in K' \ 

U = iffjv' 2 +ffJv'.V+!-ffjV 2 -E/. 

Now V 2 (f) is a given function of time, and can be written as the total 
derivative with respect to t of some other function ; the third term in 
L can therefore be omitted. Next, v' = dr'/df, where r' is the position 
of the particle in the frame K'. Hence 

m\(t)-\' = mV-dr'/dr = d(m\ ■r'ydt-mr' -dV/dt. 

Substituting in the Lagrangian and again omitting the total time deri- 
vative, we have finally 

L' = \mv'* -mW(t) r' - U, (29.4) 

where W = dY/dt is the translational acceleration of the frame K'. 
The Lagrange's equation derived from (29.4) is 

m-^- = -|^-«W(0. (29.5) 

Thus an accelerated translational motion of a frame of reference is 
equivalent, as regards its effect on the equations of motion of a par- 
ticle, to the application of a uniform field of force equal to the mass 
of the particle multiplied by the acceleration W, in the direction 
opposite to this acceleration. 

Let us now bring in a further frame of reference K, whose origin 
coincides with that of K', but which rotates relative to K' with angular 
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velocity Sl(t). Thus K executes both a translational and a rotational 
motion relative to the inertial frame K 0 . 

The velocity v' of the particle relative to K' is composed of its velocity 
v relative to K and the velocity SlXr of its rotation with K: v' = v+ 
+SI Xr (since the positions r and r' in the frames K and K' coincide). 
Substituting this in the Lagrangian (29.4), we obtain 

L = ^mv 2 +mv-£lXr+Tm(£lXr) 2 -mW-r-U. (29.6) 

This is the general form of the Lagrangian of a particle in an arbitrary, 
not necessarily inertial, frame of reference. The rotation of the frame 
leads to the appearance in the Lagrangian of a term linear in the veloc- 
ity of the particle. 

To calculate the derivatives appearing in Lagrange's equation, we 
write the total differential 

dL = mv-&v+m&v-£lxr+rm-£lxdr+ 

+ m(Slxr)-(Sl x dr) -mW -dr -(9 U/dr) -dr 
— m\'dv+md\'£lXr+mdr'\x£l+ 

+ m(Sl X r) X SI «dr -mW -dr -(9 U/Sr) -dr. 

The terms in dv and dr give 

9Z,/9v = rm+mSlXt, 

dL/dr = m\x£l+m(£lXr)x£l-mW -dU/dr. 

Substitution of these expressions in (29.2) gives the required equation 
of motion : 

m dv/dt = -dU/dr -mff + rm X SI + 2mvX SI + mSl X (rXSl). 

(29.7) 

We see that the "inertia forces" due to the rotation of the frame 
consist of three terms. The force mrXSi is due to the non-uniformity 
of the rotation, but the other two terms appear even if the rotation is 
uniform. The force 2mvXSl is called the Coriolis force; unlike any 
other (non-dissipative) force hitherto considered, it depends on the 
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velocity of the particle. The force m£lx(r X £2) is called the centrifugal 
force. It lies in the plane through r and SI, is perpendicular to the axis 
of rotation (i.e. to £2), and is directed away from the axis. The magni- 
tude of this force is m(£l 2 , where q is the distance of the particle from 
the axis of rotation. 

Let us now consider the particular case of a uniformly rotating 
frame with no translational acceleration. Putting in (29.6) and (29.7) 
£2 = constant, W = 0, we obtain the Lagrangian 

L = \mv*+mv-£lXr+^m(£lXr)*-U (29.8) 
and the equation of motion 

md\/dt= -dU/dr+2rmXSl+mSlx(rXSl). (29.9) 
The energy of the particle in this case is obtained by substituting 

p = 3L/3v = mv+mSiXr (29.10) 
in E = p-v— L, which gives 

E = ±mv* -\m(Sl X rf + U. (29. 1 1) 

It should be noticed that the energy contains no term linear in the 
velocity. The rotation of the frame simply adds to the energy a term 
depending only on the coordinates of the particle and proportional to 
the square of the angular velocity. This additional term — ^mfSlXr) 2 
is called the centrifugal potential energy. 

The velocity v of the particle relative to the uniformly rotating frame 
of reference is related to its velocity v 0 relative to the inertial frame Ko 
by 

v 0 = v+£2Xr. (29.12) 

The momentum p (29.10) of the particle in the frame K is therefore 
the same as its momentum po = rmo in the frame K 0 . The angular 
momenta Mo = rXpo and M = rXp are likewise equal. The energies 
of the particle in the two frames are not the same, however. Substitut- 
ing v from (29.12) in (29.11), we obtain E = \tm%- wv 0 -£2Xr+ U = 
jmvl+U— mrXvo'&l. The first two terms are the energy E 0 in the 
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frame K 0 . Using the angular momentum M, we have 

E = Eo-M-Sl. (29.13) 

This formula gives the law of transformation of energy when we change 
to a uniformly rotating frame. Although it has been derived for a 
single particle, the derivation can evidently be generalised immediately 
to any system of particles, and the same formula (29.13) is obtained. 

PROBLEMS 

Problem 1. Find the deflection of a freely falling body from the vertical caused 
by the Earth's rotation, assuming the angular velocity of this rotation to be small. 

Solution. In a gravitational field U = — /ng-r, where g is the gravity accelera- 
tion vector; neglecting the centrifugal force in equation (29.9) as containing the 
square of £2, we have the equation of motion 

f = 2?X£2+g. (1) 

This equation may be solved by successive approximations. To do so, we put 
v = Ti+v 2 , where Vi is the solution of the equation — g, i.e. V! = gf+T 0 (▼<> 
being the initial velocity). Substituting t = Ti+?2 in (1) and retaining only ?! 
on the right, wehaveforT 2 the equation?, = 2v!X£2 = 2fgXS2+2T 0 XS2. Integra- 
tion gives 

r = h+v o t+±gt*+it*gXSl+th 0 XSl, (2) 

where h is the initial position of the particle. 

Let the z axis be vertically upwards, and the x axis towards the pole; then 
g x = g v = 0, g, = -g; = £1 cos A, Q„ = 0, Q, = Q sin A, where A is the latitude 
(which for definiteness we take to be north). Putting t 0 = 0 in (2), we find x = 0, 
y = -\t*g£l cos A. Substitution of the time of fall t « V(Zh/g) gives finally x = 0, 
y — — Wh/gfPgQ. cos A, the negative value indicating an eastward deflection. 

Problem 2. Determine the deflection from coplanarity of the path of a particle 
thrown from the Earth's surface with velocity t 0 . 

Solution. Let the xz plane be such as to contain the velocity t 0 . The initial 
altitude h = 0. The lateral deviation is given by (2), Problem 1 : y = 
-jtBgCl^+t^ClxVot-QtVo,) or, substituting the time of flight t^2v l)l lg,y = 
4tfo 2 f2(|tf(fe cos k-v ax sin A)/# 2 . 

Problem 3. Determine the effect of the Earth's rotation on small oscillations of a 
pendulum (the problem of Foucault's pendulum). 

Solution. Neglecting the vertical displacement of the pendulum, as being a 
quantity of the second order of smallness, we can regard the motion as taking place 
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in the horizontal xy plane. Omitting terms in Q?, we have the equations of motion 
x+co 2 x = ZCl^y, y+m 2 y — —2£l z x, where to is the frequency of oscillation of the 
pendulum if the Earth's rotation is neglected. Multiplying the second equation by 
i and adding, we obtain a single equation S+2iQJ+co 2 i = 0 for the complex 
quantity £ = x+iy. For C2, ■« to, the solution of this equation is 

$ = exp ( — iCV) [A l exp (uot) + A 2 exp ( — tot)] 

or 

x+iy = (x 0 +iy 0 ) exp(-i'CV). 

where the functions x 0 (t), y 0 (t) give the path of the pendulum when the Earth's 
rotation is neglected. The effect of this rotation is therefore to turn the path about 
the vertical with angular velocity Q,. 



CHAPTER 7 



THE CANONICAL EQUATIONSt 



§30. Hamilton's equations 



The formulation of the laws of mechanics in terms of the Lagran- 
gian, and of Lagrange's equations derived from it, presupposes that the 
mechanical state of a system is described by specifying its generalised 
coordinates and velocities. This is not the only possible mode of 
description, however. A number of advantages, especially in the study 
of certain general problems of mechanics, attach to a description in 
terms of the generalised coordinates and momenta of the system. The 
question therefore arises of the form of the equations of motion 
corresponding to that formulation of mechanics. 

The passage from one set of independent variables to another can be 
effected by means of what is called in mathematics Legendre's trans- 
formation. In the present case this transformation is as follows. The 
total differential of the Lagrangian as a function of coordinates and 



t The reader may find useful the following table showing certain differences 
between the nomenclature used in this book and that which is generally used in the 
English literature. 
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velocities is 




This expression may be written 

dL = Y l Pidq i +Y i Pidq l , (30.1) 

since the derivatives dL/Qq, are, by definition, the generalised momenta, 
and dL/dqj = p\ by Lagrange's equations. Writing the second term in 
(30.1) as Zpi Aq i = d(Ep i q^~Eq i dp,, taking the differential &(Zp t q^ 
to the left-hand side, and reversing the signs, we obtain from (30.1) 

d(ZPi<ii- L ) = -£/M?i+£?/dp/. 

The argument of the differential is the energy of the system (cf. §6) ; 
expressed in terms of coordinates and momenta, it is called the 
Hamilton's function or Hamiltonian of the system : 

H{p,q,t) = Y jPi q i -L. (30.2) 
i 

From the equation in differentials 

dH= -Y,Pid<li+Y,<i' d P<> < 30 - 3 ) 
we have the equations 

q t = BH/dp,, p, = -dH/dq,. (30.4) 

These are the required equations of motion in the variables p and q, 
and are called Hamilton's equations. They form a set of 2s first-order 
differential equations for the 2s unknown functions p^t) and q^t), 
replacing the s second-order equations in the Lagrangian treatment. 
Because of their simplicity and symmetry of form, they are also called 
canonical equations. 

The total time derivative of the Hamiltonian is 

AH SH „ 87/ . „ clH . 

w = -dT + ^dq- i qi+ ^d P ; Pi - 
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Substitution of q, and p, from equations (30.4) shows that the last 
two terms cancel, and so 

dH/dt = dH/dt. (30.5) 

In particular, if the Hamiltonian does not depend explicitly on time, 
then dH/dt = 0, and we have the law of conservation of energy. 

As well as the dynamical variables q, q or q, p, the Lagrangian and 
the Hamiltonian involve various parameters which relate to the prop- 
erties of the mechanical system itself, or to the external forces on it. 
Let A be one such parameter. Regarding it as a variable, we have 
instead of (30.1) 

dL = £p, d?,+2> d<7,+(9L/9A) dA, 

and (30.3) becomes 

dH = - £p, d?/+X?/ dp,-(3L/3A) dA. 

Hence 

(3tf/3A) p><? - -(3L/3A),,,, (30.6) 

which relates the derivatives of the Lagrangian and the Hamiltonian 
with respect to the parameter A. The suffixes to the derivatives show 
the quantities which are to be kept constant in the differentiation. 

This result can be put in another way. Let the Lagrangian be of the 
form L = L 0 +L', where L' is a small correction to the function L 0 . 
Then the corresponding addition H' in the Hamiltonian H = H 0 +H' 
is related to L' by 

(H') Piq = -(L% q . (30.7) 
PROBLEMS 

Problem 1. Find the Hamiltonian for a single particle in Cartesian, cylindrical 
and spherical coordinates. 

Solution. In Cartesian coordinates x, y, z, 

H = ~{P\+Pl+Pl)+ U(x, y, z); 

in cylindrical coordinates r, </>, z, 
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in spherical coordinates r, 0, <fi, 



r*sin*0 



+ U(r. 8,<t>). 



Problem 2. Find the Hamiltonian for a particle in a uniformly rotating frame of 
reference. 

Solution. Expressing the velocity v in the energy (29.11) in terms of the momen- 
tum p by (29.10), we have H = p 2 /2m-£J-rXp+ U. 



In formulating the principle of least action, we have considered the 
integral 



taken along a path between two given positions q (1) and q (2) which the 
system occupies at given instants h and t 2 . In varying the action, we 
compared the values of this integral for neighbouring paths with the 
same values of q(h) and q(t 2 ). Only one of these paths corresponds 
to the actual motion, namely the path for which the integral S has 
its minimum value. 

Let us now consider another aspect of the concept of action, regard- 
ing S as a quantity characterising the motion along the actual path, 
and compare the values of S for paths having a common beginning 
at q{h) = q a) , but passing through different points at time t 2 . In other 
words, we consider the action integral for the true path as a function 
of the coordinates at the upper limit of integration. 

The change in the action from one path to a neighbouring path is 
given (if there is one degree of freedom) by the expression (2.5): 



§31. The Hamilton-Jacobi equation 



S = J L dt, 



(31.1) 



dS = 



9L 
Sq 




Since the paths of actual motion satisfy Lagrange's equations, the 
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integral in bS is zero. In the first term we put dq(h) = 0, and denote 
the value of dq(t 2 )by bq simply. Replacing dL/dq by p, we have finally 
6S = pdq or, in the general case of any number of degrees of freedom, 

dS = Y i p,6q l . (31.2) 

i 

From this relation it follows that the partial derivatives of the action 
with respect to the coordinates are equal to the corresponding mo- 
menta : 

dS/d qi = Pi . (31.3) 

The action may similarly be regarded as an explicit function of time, 
by considering paths starting at a given instant t\ and at a given point 
q a \ and ending at a given point q (2) at various times t 2 = t. The partial 
derivative dSjdt thus obtained may be found by an appropriate 
variation of the integral. It is simpler, however, to use formula (31.3), 
proceeding as follows. 

From the definition of the action, its total time derivative along 
the path is 

dS/dt = L. (31.4) 

Next, regarding S as a function of coordinates and time, in the sense 
described above, and using formula (31.3), we have 

dS dS „ dS . BS „ . 

A comparison gives dS/dt = L—Ep i q i or 

dS/dt = -H(p, q, t). (31.5) 
Formulae (31.3) and (31.5) may be represented by the expression 
dS = Y l P,dq i -Hdt (31.6) 

for the total differential of the action as a function of coordinates and 
time at the upper limit of integration in (31.1). 
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The action itself correspondingly becomes 

S = j(Z Pi d qi -Hdt). (31.7) 

In particular, if the function H(p, q) does not depend explicitly on the 
time, so that the energy is conserved, we can replace H(p, q) by a 
constant E, and the time dependence of 5 reduces to a term — Et : 

S(q, t) = S 0 (q)-Et, (31.8) 

where 

So(<7) = Jl>d<7 ( (31.9) 

i 

is sometimes called the abbreviated action. 

Replacing the momenta p in (31.5) by the derivatives dS/dq, we 
have the differential equation 

IMf *-i » < 3i -'°> 

which must be satisfied by the function S(q, t). This first-order partial 
differential equation is called the Hamilton- J acobi equation. For a 
single particle in an external field U(x, y, z, t), for instance, it is 
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(f)Hf-r + (sr: 



+ U(x, y, z, 0 = 0. (31.11) 



The Hamilton-Jacobi equation takes a somewhat simpler form if 
the function H(p, q) does not involve the time explicitly. Taking S(<7, t) 
from (31.8) gives for the abbreviated action S 0 (q) the equation 

"(ff---lt^ *)-* < 3I - I2 > 



§32. Adiabatic invariants 

Let us consider a mechanical system executing a finite motion in 
one dimension and characterised by some parameter 1 which specifies 
the properties of the system or of the external field in which it is 



§32 



Adiabatic invariants 



109 



placed, and let us suppose that A varies slowly (adiabatically) with 
time as the result of some external action; by a "slow" variation we 
mean one in which A varies only slightly during the period T of the 
motion : 

rdA/df«A. (32.1) 

Such a system is not closed, and its energy E is not conserved. How- 
ever, since A varies only slowly, the rate of change E of the energy 
is proportional to the rate of change A of the parameter. This means 
that the energy of the system behaves as some function of A when the 
latter varies. In other words, there is some combination of E and A 
which remains constant during the motion. This quantity is called an 
adiabatic invariant. 

Let H(p, q ; A) be the Hamiltonian of the system, which depends on 
the parameter A. According to formula (30.5), the total time derivative 
of the energy of the system is dEjdt = dHjdt = (dHjdX)(dXjdt). In 
averaging this equation over the period of the motion, we need not 
average the second factor, since A (and therefore A) varies only slowly : 
dEjdt = (dA/df)(3/7/3A), and in the averaged function 3///3A we can 
regard only p and q, and not A, as variable. That is, the averaging is 
taken over the motion which would occur if A remained constant. 

The averaging may be explicitly written 

T 

dE_d^ 1 f 8// 

dt ~ dt ' T J 3A 

o 

According to Hamilton's equation q = 3/7/3/?, or dt = dq/(dH/dp). 
The integration with respect to time can therefore be replaced by one 
with respect to the coordinate, with the period T written as 

T 

T= jdt = fdq/(dH/dp); 
o 

here the <j> sign denotes an integration over the complete range of 
variation ("there and back") of the coordinate during the period. 
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Thus 

dE dA j(dH/dX)dql(dHldp) 
dt ~ dt j dql(dH/dp) 



(32.2) 



As has already been mentioned, the integrations in this formula must 
be taken over the path for a given constant value of A. Along such a 
path the Hamiltonian has a constant value E, and the momentum is 
a definite function of the variable coordinate q and of the two inde- 
pendent constant parameters E and A. Putting therefore p = p{q ; E, A) 
and differentiating with respect to A the equation H(p, q; A) = E, we 
have dH/dX+(dHldp)(clpldX) = 0, or 

dH/dX _ dp 
dH/dp~ ~ 9A " 

Substituting this in the numerator of (32.2) and writing the integrand 
in the denominator as dpjdE, we obtain 



dE dA j(dp/dX)dq 
dt dt j(dp/dE)dq 



(32.3) 



or 



C(dpdE dp dX\^ 
Finally, this may be written as 



d//d/ = 0, (32.4) 

where 

1= jp dqjln, (32.5) 

the integral being taken over the path for given E and A. This shows 
that, in the approximation here considered, J remains constant when 
the parameter A varies, i.e. / is an adiabatic invariant. 

The integral (32.5) has a geometrical significance in terms of the 
phase path of the system, i.e. the curve which represents the depend- 
ence of p on q; the phase path of a system executing a periodic motion 



§32 



Adiabatic invariants 



111 



is a closed curve. The integral (32.5) taken round this curve is the 
area enclosed. 

As an example, let us determine the adiabatic invariant for a one- 
dimensional oscillator. The Hamiltonian is H = \p 2 \m+\rmi 2 q 2 , 
where to is the frequency of the oscillator. The equation of the phase 
path is given by the law of conservation of energy H(p, q) = E. The 
path is an ellipse with semiaxes -y/ (2mE) and y/ (2E/nwo 2 ), and its area, 
divided by In, is 

/ = E/co. (32.6) 

The adiabatic invariance of / signifies that, when the parameters of the 
oscillator vary slowly, the energy is proportional to the frequency. 
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§33. Velocity of propagation of interactions 

The interaction of material particles is described in ordinary mech- 
anics by means of a potential energy of interaction, which is a function 
of the coordinates of the interacting particles. It is easy to see that this 
manner of describing interactions contains the assumption of instanta- 
neous propagation of interactions. For the forces exerted on each 
of the particles by the other particles at a particular instant of time 
depend, according to this description, only on the positions of the 
particles at this one instant. A change in the position of any of the 
interacting particles influences the other particles immediately. 

However, experiment shows that instantaneous interactions do not 
exist in nature. Thus a mechanics based on the assumption of instanta- 
neous propagation of interactions contains within itself a certain 
inaccuracy. In actuality, if any change takes place in one of the 
interacting bodies, it will influence the other bodies only after the 
lapse of a certain interval of time. Dividing the distance between the 
two bodies by this time interval, we obtain the velocity of propagation 
of the interaction. 

This velocity should, strictly speaking, be called the maximum veloc- 
ity of propagation of interactions. It determines only that interval 
of time after which a change occurring in one body begins to manifest 
itself in another by the arrival of a signal. It is clear that the existence 
of a maximum velocity of propagation of interactions implies, at the 
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same time, that motions of bodies with greater velocity than this are 
impossible in nature. 

From the principle of relativity it follows that the velocity of pro- 
pagation of interactions is the same in all inertial frames of reference. 
Thus the velocity of propagation of interactions is a universal con- 
stant. 

This constant velocity (as we shall show later) is also the velocity of 
light in empty space. The velocity of light is usually designated by the 
letter c, and its numerical value is 

c = 2-998X10 10 cm/sec. (33.1) 

The large value of this velocity explains the fact that in practice 
classical mechanics is found to be sufficiently accurate in most cases. 
The velocities with which we have occasion to deal are usually so small 
compared with the velocity of light that the assumption that the latter 
is infinite does not materially affect the accuracy of the results. 

The combination of the principle of relativity with the finiteness of 
the velocity of propagation of interactions is called Einstein's relativity 
principle (it was formulated by Einstein in 1905) in contrast to Galileo's 
relativity principle, which was based on an infinite velocity of pro- 
pagation of interactions. 

The mechanics based on Einstein's relativity principle (we shall 
usually refer to it simply as the principle of relativity) is called relativ- 
istic. In the limiting case when the velocities of the moving bodies are 
small compared with the velocity of light we can neglect the effect on 
their motion of the finiteness of the velocity of propagation. Then 
relativistic mechanics goes over into classical mechanics, based on the 
assumption of instantaneous propagation of interactions. The limiting 
transition from relativistic to classical mechanics can be produced 
formally by the transition to the limit c — °° in the formulae of relativ- 
istic mechanics. 

In classical mechanics distance is already relative, i.e. the spatial 
relations between different events depend on the frame of reference 
in which they are described. The statement that two non-simultaneous 
events occur at one and the same point in space or, in general, at a 
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definite distance from each other, acquires a meaning only when we 
indicate the frame of reference which is used. 

On the other hand, time is absolute in classical mechanics ; in other 
words, the properties of time are assumed to be independent of the 
frame of reference; there is one time for all frames. This means that 
if any two phenomena occur simultaneously for any one observer, 
then they occur simultaneously also for all others. In general, the 
interval of time between two given events must be identical for all 
frames of reference. 

It is easy to show, however, that the idea of an absolute time is in 
complete contradiction to Einstein's relativity principle. For this it is 
sufficient to recall that in classical mechanics, based on the concept 
of an absolute time, the familiar law of composition of velocities is 
valid, according to which the velocity of a composite motion is simply 
equal to the (vector) sum of the velocities of the constituent motions. 
This law, being universal, should also be applicable to the propagation 
of interactions. From this it would follow that the velocity of propa- 
gation must be different in different inertial frames of reference, in con- 
tradiction to the principle of relativity. In this matter experiment com- 
pletely confirms the principle of relativity. Measurements first per- 
formed byMichelson (1881) showed complete lack of dependence of 
the velocity of light on its direction of propagation, whereas according 
to classical mechanics the velocity of light should not be the same in 
the direction of the Earth's motion as it is in the opposite direction. 

Thus the principle of relativity leads to the result that time is not 
absolute. Time elapses differently in different frames of reference. 
Consequently the statement that a definite time interval has elapsed 
between two given events acquires meaning only when the reference 
frame to which this statement applies is indicated. In particular, events 
which are simultaneous in one reference frame will not be simultaneous 
in other frames. 

To clarify this, it is instructive to consider the following simple 
example. Let us look at two inertial reference frames K and K' with 
coordinate axes XYZ and X' Y' Z' respectively, where the frame K' 
moves relative to K along the axis X(X') (Fig. 27). 



§34 



Intervals 



115 



z 



Z' 



B— A— C 



X' 




X 



Y 



Y' 



Fig. 27 



Suppose signals start out from some point A on the axis X' in two 
opposite directions. Since the velocity of propagation of a signal in 
the frame K', as in all inertial frames, is equal (for both directions) to 
c, the signals will reach points B and C, equidistant from A, at one 
and the same time (in the frame K'). But it is easy to see that the same 
two events (arrival of the signal at B and C) can by no means be 
simultaneous for an observer in the frame K: the velocity of signals 
relative to the frame A^has, according to the principle of relativity, the 
same value c, and since the point B moves (relative to the frame K) 
toward the source of its signal, while the point C moves in the direc- 
tion away from the signal (sent from A to C), in the frame K the signal 
will reach the point B earlier than the point C. 

Thus Einstein's relativity principle introduces fundamental changes 
in basic physical concepts. The notions of space and time derived by 
us from our daily experiences are only approximations due to the fact 
that in daily life we happen to deal only with velocities which are very 
small compared with the velocity of light. 



In what follows we shall frequently use the concept of an event. 
An event is described by the place where it occurred and the time 
when it occurred. Thus an event occurring in a certain material par- 
ticle is defined by the three coordinates of that particle and the time 
when the event occurs. 
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It is frequently useful for reasons of presentation to use a fictitious 
four-dimensional space, on the axes of which are marked three space 
coordinates and the time. In this space events are represented by points, 
called world points. In this four-dimensional space there corresponds 
to each particle a certain line, called a world line. The points of this 
line determine the coordinates of the particle at all moments of time. 
It is easy to show that to a particle in uniform rectilinear motion there 
corresponds a straight world line. 

We now express the principle of the invariance of the velocity of 
light in mathematical form. For this purpose we consider two frames 
of reference K and K' moving relative to each other with constant 
velocity. We choose the coordinate axes so that the axes X and X' 
coincide, while the axes Y and Z are parallel to V and Z'; we designate 
the time in the frames K and K' by t and t' . 

Let the first event consist of sending out a signal, propagating with 
light velocity, from a point having coordinates *i yi zi in the frame K, at 
time ti in this frame. We observe the propagation of this signal in the 
frame K. Let the second event consist of the arrival of the signal at 
point at the moment of time t%. The signal propagates with 

velocity c; the distance covered by it is therefore c{tz— ti). On the other 

l_ 

hand, this same distance equals [(x 2 — xi) 2 +(y 2 — yi) 2 +(z 2 — zi) 2 ] 2 . 
Thus we can write the following relation between the coordinates of the 
two events in the frame K: 

(jc a -*i) 2 + (y» -yif + (z2 -zif -c\h-hf = 0. (34. 1) 
The same two events, i.e. the propagation of the signal, can be 
observed from the frame K'. Let the coordinates of the first event in 
the frame K' be x'^z'^, and of the second: x'^y'^' 2 t' 2 . Since the 
velocity of light is the same in the frames K and K', we have, similarly 
to (34.1), 

(*; -xtf+W -ytf+te -*D 2 - -ti) 2 = 0. (34.2) 
If xiyizih and X2j 2 Z2?2 are the coordinates of any two events, then 
the quantity 

S12 = [c\h-h)*-(x* -xif -b>2 -yif -(zi -ziff (34.3) 
is called the interval between these two events. 
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Thus it follows from the principle of invariance of the velocity of 
light that, if the interval between two events is zero in one frame of 
reference, then it is equal to zero in all other frames. 

If two events are infinitely close to each other, then for the interval 
ds between them 

ds 2 = c 2 df 2 -dx 2 -dj 2 -dz 2 . (34.4) 

The form of (34.3) and (34.4) permits us to regard the interval, from 
the formal mathematical point of view, as the distance between two 
points in a fictitious four-dimensional space (whose coordinates are 
x, y, z, and the product ct). But there is a basic difference between 
the rule for forming this quantity and the rule in ordinary geometry: 
in forming the square of the interval, the squares of the coordinate 
differences along the different axes are taken, not with the same sign, 
but rather with varying signs. 1. 

As already shown, if ds = 0 in one inertial frame, then ds' = 0 in 
any other frame. On the other hand, ds and ds' are infinitesimals of the 
same order. From these two conditions it follows that ds 2 and ds' 2 
must be proportional to each other : 

ds 2 = a ds' 2 , 

where the coefficient a can depend only on the absolute value of the 
relative velocity of the two inertial frames. It cannot depend on the 
coordinates or the time, since then different points in space and differ- 
ent moments in time would not be equivalent, which would be in 
contradiction to the homogeneity of space and time. Similarly, it 
cannot depend on the direction of the relative velocity, since that 
would contradict the isotropy of space. 

Let us consider three frames of reference K, Ki, K 2 , and let Vi and V2 
be the velocities of Ki and K 2 relative to K. We then have 

ds 2 = a(F,) ds 2 , ds 2 = a(V 2 ) dsf. 

t The four-dimensional geometry described by the quadratic form (34.4) was 
introduced by Minkowski, in connection with the theory of relativity. This geometry 
is called pseudo-Euclidean, in contrast to ordinary Euclidean geometry. 
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Similarly we can write 

dsf = a(V 12 ) dsf , 

where Vi 2 is the absolute value of the velocity of K 2 relative to Ki. 
Comparing these relations with one another, we find that we must 
have 

a(V 2 ) 



= a(V 12 ). (34.5) 



But Vu depends not only on the absolute values of the vectors Vi 
and V 2 , but also on the angle between them. However, this angle does 
not appear on the left side of formula (34.5). It is therefore clear that 
this formula can be correct only if the function a(V) reduces to a 
constant, which is equal to unity according to this same formula. 
Thus, 

ds* = ds' z , 

and from the equality of the infinitesimal intervals there follows the 
equality of finite intervals : s = s'. 

Thus we arrive at a very important result : the interval between two 
events is the same in all inertial frames of reference, i.e. it is invariant 
under transformation from one inertial frame to any other. This 
invariance is the mathematical expression of the constancy of the 
velocity of light. 

Again let xiyizih and x^y^z^U be the coordinates of two events 
in a certain frame of reference K. Does there exist a frame K' in which 
these two events occur at one and the same point in space? 

We introduce the notation 

h - h = h 2 , {x 2 -xj* + (y 2 -y^ + {z 2 -Zj) 2 = l\ 2 . 
Then the interval between the events in the frame K is 

^12 ~ £^12 ^12 

and in the frame K' 

A 12 — <" *12 '12 > 

whereupon, because of the invariance of intervals, 

<*l 2 2-/l 2 2 = ^i-/i2 2 . 
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We want the two events to occur at the same point in the frame K', 
that is, we require l' 12 — 0. Then 



s- 



2 = cH\\ >0. 



12 — «- '12 — '12 — «- '12 

Consequently a frame of reference with the required property exists 
if ^2 > 0, that is, if the interval between the two events is a real number. 
Real intervals are said to be time-like. 

Thus, if the interval between two events is time-like, then there 
exists a frame of reference in which the two events occur at one and the 
same place. The time which elapses between the two events in this 
frame is 

tn = ^V(<?th-%) = - c 2 ■ (34.6) 

If two events occur in one and the same body, then the interval 
between them is always time-like, for the distance which the body 
moves between the two events cannot be greater than c/i 2 , since the 
velocity of the body cannot exceed c. So we have always 

'l2 < cfi 2 . 

Let us now ask whether or not we can find a frame of reference in 
which two events occur at one and the same time. As before, we have 
for the frames K and K' cH\ 2 -l\ 2 = cH'? 2 -l'* 2 . We want to have 
t' 12 = 0, so that 

*12 



*?» = -/,! < o. 



Consequently the required frame can be found only for the case 
when the interval si 2 between the two events is an imaginary number. 
Imaginary intervals are said to be space-like. 

Thus if the interval between two events is space-like, there exists 
a frame of reference in which the two events occur simultaneously. 
The distance between the points where the events occur in this frame is 

l'i2 = V(/i 2 2-c 2 fi 2 2 ) = |*,|. (34.7) 



The division of intervals into space- and time-like intervals is, 
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because of their invariance, an absolute concept. This means that the 
time-like or space-like character of an interval is independent of the 
frame of reference. 

Let us take some event O as our origin of time and space coordinates. 
In other words, in the four-dimensional system of coordinates, the 
axes of which are marked x, y, z, t, the world point of the event O is 
the origin of coordinates. Let us now consider what relation other 




Fig. 28 



events bear to the given event O. For visualisation, we shall consider 
only one space dimension and the time, marking them on two axes 
(Fig. 28). Uniform rectilinear motion of a particle, passing through 
x = 0 at t = 0, is represented by a straight line going through O and 
inclined to the t axis at an angle whose tangent is the velocity of the 
particle. Since the maximum possible velocity is c, there is a maximum 
angle which this line can make with the t axis. In Fig. 28 are shown 
the two lines representing the propagation of two signals (with the 
velocity of light) in opposite directions passing through the event O 
(i.e. going through x = 0 at t = 0). All lines representing the motion 
of particles can lie only in the regions aOc and dOb. On the lines ab 
and cd, x = + ct. First consider events whose world points lie within 
the region aOc. It is easy to show that for all the points of this region 
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c 2 t 2 —x 2 > 0. In other words, the interval between any event in this 
region and the event O is time-like. In this region t > 0, i.e. all the 
events in this region occur "after" the event O. But two events which are 
separated by a time-like interval cannot occur simultaneously in any 
frame of reference. Consequently it is impossible to find a frame of 
reference in which any of the events in the region aOc occurred 
"before" the event O, i.e. at time t < 0. Thus all the events in the 
region aOc are future events relative to O in all frames of reference. 
Therefore this region can be called the absolute future relative to O. 

In exactly the same way, all events in the region bOd are in the 
absolute past relative to O; i.e. events in this region occur before the 
event O in all frames of reference. 

Finally, consider the regions dOa and cOb. The interval between 
any event in this region and the event O is space-like. These events 
occur at different points in space in every reference frame. Therefore 
these regions can be said to be absolutely remote relative to O. How- 
ever, the concepts "simultaneous", "earlier", and "later" are relative 
for these events. For any event in these regions there exist frames of 
reference in which it occurs after the event O, frames in which it occurs 
earlier than O, and finally one reference frame in which it occurs 
simultaneously with O. 

Note that if we consider all three space coordinates instead of just 
one, then instead of the two intersecting lines of Fig. 28 we would 
have a "cone" x 2 +y 2 +z 2 ~ c 2 t 2 = 0 in the four-dimensional coordi- 
nate system x, y, z, t, the axis of the cone coinciding with the t axis. 
(This cone is called the light cone.) The regions of "absolute future" 
and "absolute past" are then represented by the two interior portions 
of this cone. 

Two events can be related causally to each other only if the interval 
between them is time-like; this follows immediately from the fact that 
no interaction can propagate with a velocity greater than the velocity 
of light. As we have just seen, it is precisely for these events that the 
concepts "earlier" and "later" have an absolute significance, which is 
a necessary condition for the concepts of cause and effect to have 
meaning. 
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§35. Proper time 

Suppose that in a certain inertial reference frame K we observe a 
clock which is moving relative to us in an arbitrary manner. We also 
introduce an inertial frame K' moving relative to K with the velocity 
v of the clock at that time. 

In the course of an infinitesimal time interval dt (as read by a clock 
in our rest frame) the moving clock goes a distance y/ (dx 2 + dy 2 + dz 2 ). 
Let us ask what time interval dt' is indicated for this period by the 
moving clock. In the frame K' linked to the moving clock, the latter 
is at rest, i.e., dx' = dy' = dz' = 0. Because of the invariance of 
intervals 

d.s 2 = c 2 dt 2 -dx 2 -dy 2 -dz 2 = c 2 dt' 2 , 

from which 
But 

dx 2 +d.y 2 +dz 2 

1 = 71*5 

dt 2 

where v is the velocity of the moving clock ; therefore 

ds 

dt' = - - = dfVO ~v 2 /c 2 ). (35.1) 

Integrating this expression, we can obtain the time interval indicated 
by the moving clock when the elapsed time according to a clock at 
rest is f 2 — ti- 

t' t -t[= I dtVV-v 2 /c 2 ). (35.2) 
'i 

The time read by a clock moving with a given object is called the 
proper time for this object. Formulae (35.1) and (35.2) express the 
proper time in terms of the time for a frame of reference from which 
the motion is observed. 
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As we see from (35.1) or (35.2), the proper time of a moving object 
is always less than the corresponding interval in the rest frame. In 
other words, the moving clock goes more slowly than the one at rest. 

Suppose a clock is moving in uniform rectilinear motion relative 
to an inertial frame K. A reference frame K' linked to this clock is 
also inertial. Then from the point of view of an observer in the frame 
K the clock in the frame K' falls behind. And conversely, from the 
point of view of the frame K\ a clock in K lags. To convince ourselves 
that there is no contradiction, let us note the following. In order to 
establish that the clock in the frame K' lags behind that in the frame K, 
we must proceed in the following fashion. Suppose that at a certain 
moment the clock in K' passes by the clock in K, and at that moment 
the readings of the two clocks coincide. To compare the rates of the 
two clocks in K and K' we must once more compare the reading of the 
same moving clock in K' with a clock in K. But now we compare this 
clock with a different clock in K — with that past which the clock in K' 
goes at another time. Then we find that the clock in K' lags behind 
this other clock in K with which it is now being compared. We see 
that to compare the rates of clocks in two reference frames we require 
several clocks in one frame and one in the other, and that therefore 
this process is not symmetric with respect to the two frames. The 
clock that appears to lag is always the one which is being compared 
with different clocks in the other frame. 

If we have two clocks, one of which describes a closed path return- 
ing to the starting point (the position of the clock which remained at 
rest), then the moving clock appears to lag relative to the one at rest. 
The converse reasoning, in which the moving clock would be consid- 
ered to be at rest (and vice versa) is now impossible, since the clock 
describing a closed trajectory does not carry out a uniform rectilinear 
motion, so that a frame of reference linked to it will not be inertial. 
Since the laws of nature are the same only for inertial reference frames, 
the frames linked to the clock at rest (inertial frame) and to the moving 
clock (non-inertial) have different properties, and the argument which 
leads to the result that the clock at rest must lag is not valid. 
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§36. The Lorentz transformation 

Our purpose is now to obtain the formulae of transformation from 
one inertial frame of reference to another, that is, formulae by means 
of which, knowing the coordinates x, y, z, t, of an event in a certain 
frame K, we can find the coordinates x', y', z', t' of the same event in 
another inertial frame K'. 

In classical mechanics this question is resolved by the simple 
formulae for a Galileo transformation (3.1), (3.2). If the frame K' 
moves relative to the frame K along the common direction of the axes 
X and X' , these formulae have the form 

x = x '+ vt', y = f, z = z', t = f. (36.1) 

This transformation, as was to be expected, does not satisfy the 
requirements of the theory of relativity; it does not leave the interval 
between events invariant. 

We shall obtain the relativistic transformations precisely as a con- 
sequence of the requirement that they leave the interval between events 
invariant. 

As we saw in §34, the interval between events can be looked on as 
the distance between the corresponding pair of world points in a four- 
dimensional system of coordinates. Consequently we may say that the 
required transformation must leave unchanged all distances in the 
four-dimensional x, y, z, ct space. But such transformations consist 
only of parallel displacements and rotations of the coordinate system. 
Of these the displacement of the coordinate system parallel to itself 
is of no interest, since it leads only to a shift in the origin of the space 
coordinates and a change in the time reference point. Thus the required 
transformation must be expressible mathematically as a rotation of 
the four-dimensional x, y, z, ct coordinate system. 

Every rotation in the four-dimensional space can be resolved into 
six rotations, in the planes xy, zy, xz, tx, ty, tz (just as every rotation in 
ordinary space can be resolved into three rotations in the planes xy, 
zy, and xz). The first three of these rotations transform only the space 
coordinates; they correspond to the usual space rotations. 
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Let us consider a rotation in the tx plane; under this, the y and z 
coordinates do not change. This transformation must leave unchanged, 
in particular, the difference (ct) 2 — x 2 , the square of the "distance" 
from the point (ct, x) to the origin. The relation between the old and 
new coordinates in this transformation is given in general form by 
the formulae 

x = x' cosh y>+ct' sinh y>, ct = x' sinh y> + ct' cosh y>, (36.2) 

where \p is the "angle of rotation " ; a simple check shows that indeed 
c 2 t 2 — x 2 = c 2 t' 2 — x' 2 . Formulae (36.2) differ from the usual formulae 
for transformation under a rotation of the axes in that trigonometric 
functions are replaced by hyperbolic functions. This is a manifestation 
of the difference between pseudo-Euclidean and Euclidean geometry. 

We try to find the formula of transformation from an inertial reference 
frame AT to a frame K' moving relative to K with velocity V along the 
x axis. In this case clearly only the coordinate x and the time t are 
subject to change. Therefore this transformation must have the form 
(36.2). Now it remains only to determine the angle y>, which can depend 
only on the relative velocity V} 

Let us consider the motion, in the frame K, of the origin of the frame 
K'. Then x' — 0 and formulae (36.2) take the form 



But x/t is clearly the velocity V of the frame K' relative to K. So 



x = ct' sinh y>, ct = ct' cosh f, 



or, dividing one by the other, 



x/ct = tanh y>. 



tanh ip = V/c. 



From this 



sinh f = 



Vjc 



cosh ip 



1 



VV-(V 2 /c 2 )] ' 



V[l-(^ 2 /c 2 )] • 



t Note to avoid confusion that we shall always use V to signify the constant 
relative velocity of two inertial frames, and v for the velocity of a moving particle, 
not necessarily constant. 
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Substituting in (36.2), we find 
_ x'+Vt' , , _ t' + (Vx'/c*) 

x ~ VV-ivVc 2 )]' y ~ y ' z_z ' V[i-(W)]" 1 ; 

This is the required transformation formula. It is called the Lorentz 
transformation, and is of fundamental importance for what follows. 

The inverse formulae, expressing x', y', z', t' in terms of x, y, z, t, are 
most easily obtained by changing V to —V (since the frame K moves 
with velocity — V relative to the frame K'). The same formulae can 
be obtained directly by solving equations (36.3) for x', y', z', t'. 

It is easy to see from (36.3) that on making the transition to the 
limit c — o= and classical mechanics, the formula for the Lorentz 
transformation actually goes over into the Galileo transformation. 

For V > c in formula (36.3) the coordinates x, / are imaginary; 
this corresponds to the fact that motion with a velocity greater than 
the velocity of light is impossible. Moreover, one cannot use a refer- 
ence frame moving with the velocity of light — in that case the denomin- 
ators in (36.3) would go to zero. 

Suppose there is a rod at rest in the frame K, parallel to the x axis. 
Let its length, measured in this frame, be Ax = x 2 — xi (x 2 and xi are 
the coordinates of the two ends of the rod in the frame K). We now 
determine the length of this rod as measured in the frame K'. To do 
this we must find the coordinates of the two ends of the rod (x' 2 and 
x{) in this frame at one and the same time t'. From (36.3) we find 

_ x[+Vt' _ x' 2 +Vt' 

Xl ~ VI i - ( v*lc 2 )] ' X2 ~ Vt i - ( ^/c 2 )] " 



The length of the rod in the frame K' is Ax' = x 2 — x[; subtracting 
xi from x 2 , we find 

Ax' 



Ax = 



V[i-(W)] 



The proper length of a rod is its length in a reference frame in which 
it is at rest. Let us denote it by lo = Ax, and the length of the rod in 
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any other reference frame K' by /. Then 

l = loVV-(V 2 lc% (36.4) 

Thus a rod has its greatest length in the reference frame in which 
it is at rest. Its length in a frame in which it moves with velocity V is 
decreased by the factor — (V 2 /c 2 )]. This result of the theory of 
relativity is called the Lorentz contraction. 

Since the transverse dimensions of a body do not change because 
of its motion, the volume <V of a body decreases according to the 
similar formula 

« = «oV[l-(m], (36.5) 

where (Vq is the proper volume of the body. 

From the Lorentz transformation we can obtain anew the results 
already known to us concerning the proper time (§35). Suppose a 
clock to be at rest in the frame K'. We take two events occurring at 
one and the same point x', y', z' in space in the frame K'. The time 
between these events in the frame K' is At' = t' 2 —t[. Now we find 
the time At which elapses between these two events in the frame K. 
From (36.3), we have 

WVx'W 4 + (Vx'[c*) 

1 v[i-(^/c 2 )]' 2 V[i-(^ 2 )]' 

or, subtracting one from the other, 

_ At' 

h h ~ x ~ vn-^ 2 )] 

in complete agreement with (35.1). 

§37. Transformation of velocities 

In the preceding section we obtained formulae which enable us to 
find, from the coordinates of an event in one reference frame, the 
coordinates of the same event in a second reference frame. Now we 
find formulae relating the velocity of a material particle in one refer- 
ence frame to its velocity in a second reference frame. 
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Let us suppose once again that the frame K' moves relative to the 
frame A' with velocity V along the x axis. Let v x = dx/dt be the compo- 
nent of the particle velocity in the frame K and v' x = dx'/dt' the velocity 
component of the same particle in the frame K'. From (36.3), we 
have 

dx'+Vdt' dt' + (Vdx'lc*) 

^VU^fiW y= y ' ' d ' = W=(W)]- 

Dividing the first three equations by the fourth and introducing the 
velocities 

dr , dr' 

v= d7' y =W 

we find 

y'x+v _ vw^-(y 2 i^)) v = vwv-(v 2 /c 2 )] 

x l+iv'tV/c*)' y 1+KK/c 2 ) ' 2 l+(v' x V/c*) ' 

(37.1) 

These formulae determine the transformation of velocities. They 
describe the law of composition of velocities in the theory of relativity. 
In the limiting case of c -»- , they go over into the formulae v x = v' x + 
+ V, v y = v' y , v 2 = v' z of classical mechanics. 

In the special case of motion of a particle parallel to the x axis, 
v x = v, v y = v 2 = 0. Then v' y = v' z = 0, v' x = v', so that 

V ' +V (37.2) 



l+(v'Vlc 2 ) 

It is easy to convince oneself that the sum of two velocities each 
smaller than or equal to the velocity of light is again not greater than 
the light velocity. 

Let us choose our coordinate axes so that the velocity of the particle 
at the given moment lies in the xy plane. Then the velocity of the 
particle in the frame K has components v x = v cos 6, v y = v sin 8, 
and in the frame K' v' x = v' cos 6', v' y = v' sin 6' (v, v', 6, 6' are the 
absolute values and the angles subtended with the x and x' axes, 
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respectively in the frames K, K'). With the help of formulae (37.1), 
we then find 

tofl = i! >fly[i-(^] . (3V . 3) 

This formula describes the change in the direction of the velocity 
on transforming from one reference frame to another. 

Let us consider a very important special case of this formula, namely, 
the deviation of light in transforming to a new reference frame — a 
phenomenon known as the aberration of light. In this case v = v' = c, 
so that the preceding formula goes over into 

tan ^-^ W. (37.4) 
K/c+cos 0' v ' 

In the case V <sc c, we find from this formula, correct to terms of 
order V/c, 

tan 6 = tanfl'O -K/c cos AO- 
Introducing the angle Ad = 6' — 0 (the aberration angle), we find 
to the same order of accuracy 

AO — ~r sm 0'> ( 37 -5) 



which is the well-known elementary formula for the aberration of 
light. 

§38. Four-vectors 

The coordinates of an event {ct, x, y, z) can be considered as the 
components of a four-dimensional position vector (or, for short, a 
position four-vector) in a four-dimensional space. We shall denote its 
components by x*, where the index /j, takes the values 0, 1,2, 3, and 

x° = ct, x 1 = x, x 2 = y, x 3 = z. 
The square of the "length" of the position four-vector is given by 



(x°) 2 -(x 1 ) 2 -(x 2 ) 2 -(x 3 ) 2 . 
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It does not change under any rotations of the four-dimensional 
coordinate system, in particular under Lorentz transformations. 

In general a set of four quantities A 0 , A 1 , A 2 , A 3 which transform 
like the components of the position four-vector^ xf under transforma- 
tions of the four-dimensional coordinate system is called a four- 
dimensional vector {four-vector) A M . Under Lorentz transformations, 

A"> + {VA*lc) A*+{VA«lc) 

VU-(W)]' VD-C^/c 2 )]' a , A 

(38.1) 

The squared magnitude of any four-vector is defined analogously to 
the square of the radius four-vector: 

(A°) 2 -(Aiy-(A*) 2 -(A 3 y. 

For convenience of notation, we introduce two "types" of components 
of four-vectors, denoting them by the symbols A" and A fl , with super- 
scripts and subscripts. These are related by 

A 0 = A 0 , A x = -A\ A 2 = —A 2 , A s = -A 3 . (38.2) 

The quantities A" are called the contravariant, and the A^ the covariant 
components of the four-vector. The square of the four-vector then 
appears in the form 

3 

I A"A M = A'Ao+A^ + A'Aa + A'Aa. 

Such sums are customarily written simply as A M A , omitting the 
summation sign. The convention is that a summation is effected over 
any index which appears twice, and the summation sign is omitted. 
Of the pair of identical indices, one must be a superscript and the 
other a subscript. This convention for summation over "dummy" 
indices is very convenient and considerably simplifies the writing of 
formulae. 



t We use Greek letters A, /u,, v, ... for four-dimensional indices, taking values 
0, 1, 2, 3. 
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In analogy to the square of a four-vector, one forms the scalar 
product of two different four-vectors: 

A"Bp = AOBo+AWi+AtBs + AiBi. 

It is clear that this can be written either as A^B^ or AJi 11 — the result 
is the same. In general one can switch upper and lower indices in any 
pair of dummy indices.* 

The product A^B^ is a four-scalar — it is invariant under rotations of 
the four-dimensional coordinate system. This is easily verified directly, 
but it is also apparent beforehand (from the analogy with the square 
A^A^ from the fact that all four-vectors transform according to the 
same rule. 

The component A 0 is called the time component, and A 1 , A 2 , A 3 the 
space components of the four-vector (in analogy to the position four- 
vector). The square of a four-vector can be positive, negative, or zero; 
such vectors are called time-like, space-like, and null vectors, respect- 
ively (again in analogy to the terminology for intervals). 

Under purely spatial rotations (i.e. transformations not affecting 
the time axis) the three space components of the four-vector A 11 form a 
three-dimensional vector A. The time component of the four-vector 
is a three-dimensional scalar (with respect to these transformations). 
In enumerating the components of a four-vector, we shall often write 
them as 

A" = (A 0 , A). 

The covariant components of the same four-vector are A M = (A 0 , —A), 
and the square of the four- vector is A^A^ = (A 0 ) 2 — A 2 . Thus, for the 
position four-vector, 

x" = (ct,t), x M = (ct,-t), x% = c*/ s -i» 

For three-dimensional vectors (with coordinates x, y, z) there is no 
need to distinguish between contra- and covariant components. 



t In the literature the indices are often omitted on four-vectors, and their 
squares and scalar products are written as A 1 , AB. We shall not use this notation in 
the present book. 
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Whenever this can be done without causing confusion, we shall write 
their components as A t (i = x, y, z) using Latin letters for subscripts. 
In particular we shall assume a summation over x, y, z for any repeated 
index (for example, A-B = Afi^. 

A four-dimensional tensor {four-tensor) of rank two is a set of sixteen 
quantities A 1 "', which under coordinate transformations transform 
like the products of components of two four-vectors. We similarly 
define four- tensors of higher rank. 

The components of a four-tensor of rank two can be written in three 
forms: covariant, A^, contravariant, A''", and mixed, A'\ (where, in 
the last case, one should in general distinguish between A f ' r and A J, 
i.e. one should be careful about whether the first or the second 
is the subscript). The connection between the different types of 
components is determined from the general rule : raising or lowering a 
space index (1, 2, 3) changes the sign of the component, while raising 
or lowering the time index (0) does not. Thus: 

A 00 = A 00 , Aoi = -A 01 , Au = A 11 , 
A 0 ° = A 00 , AJ = A 01 , A\ = -A 01 , A* = —A 11 , .... 

Under purely spatial transformations, the nine quantities A 11 , A 12 , 
. . . form a three-dimensional tensor. The three components A 01 , A 02 , 
A 03 and the three components A 10 , A 20 , A 30 constitute three-dimensional 
vectors, while the component A 00 is a three-dimensional scalar. 

A tensor A 1 " is said to be symmetric if A 1 " = A" 1 ', and antisymmetric 
if A 1 " = — A" 1 . In an antisymmetric tensor, all the diagonal components 
(i.e. the components A 00 , A 11 , . . .) are zero, since, for example, we 
must have A 00 = — A 00 . For a symmetric tensor A'", the mixed 
components A f ' r and A J 1 obviously coincide; in such cases we shall 
simply write A^, putting the indices one above the other. 

In every tensor equation, the two sides must contain identical and 
identically placed (i.e. above or below) free indices (as distinguished 
from dummy indices). The free indices in tensor equations can be 
shifted up or down, but this must be done simultaneously in all terms 
in the equation. Equating covariant and contravariant components of 
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different tensors is "illegal"; such an equation, even if it happened by 
chance to be valid in a particular frame of reference, would be violated 
on going to another frame. 

From the tensor components A"" one can form a scalar by taking 
the sum 

A^p = A°q-\- A l ±-\- A^2~\~ A 3 $ 

(where, of course, A" fl = A*}. This sum is called the trace of the tensor, 
and the operation for obtaining it is called contraction. 

The formation of the scalar product of two four-vectors, considered 
earlier, is a contraction operation: it is the formation of the scalar 
A"B /i from the tensor A"B r . In general, contracting on any pair of 
indices reduces the rank of the tensor by 2. For example, A" v)itl is a 
tensor of rank two, A",!}" is a four-vector, A""^ is a scalar, etc. 

The unit four-tensor bf v , satisfies the condition that, for any four- 
vector A", 

b^A* = A'. (38.3) 
It is clear that the components of this tensor are 

if fi = v, 



* to, 



• f ^ (38.4) 

if fi ^ v. 

Its trace is = 4. 

By raising the one index or lowering the other in 6^, we can obtain 
the contra- or covariant tensor gf" or g^, which is called the metric 
tensor. The tensors gf" and g^ have identical components, which can 
be written as a matrix: 




(the index p labels the rows, and v the columns, in the order 0, 1, 2, 3). 
It is clear that 

g Mr A' = A m g"A, = A", (38.6) 

and this is the tensor form of the operation of lowering or raising 
indices. 
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Finally we consider some differential and integral operations of 
four-dimensional tensor analysis. 
The four-gradient of a scalar 4> is the four-vector 



Note that the derivatives should be legarded as the covariant com- 
ponents of the four-vector. For the differential of a scalar 



is also a scalar; from its form (scalar product of two four-vectors) our 
statement is clear. 

In general, operators of differentiation with respect to the coordi- 
nates y, 3/3x", should be regarded as the covariant components of an 
operator four-vector. Thus the divergence of a four-vector, the 
expression dA^jdx", in which the contravariant components of the 
four-vector A M are differentiated, is a scalar. 

In three-dimensional space we can integrate over a volume, a 
surface, or a curve. There are four corresponding types of integrations 
in four-dimensional space: (1) over a curve in four-space, (2) over a 
(two-dimensional) surface, (3) over a hypersurface, i.e. a three-dimen- 
sional manifold, (4) over a four-dimensional volume. 

In analogy to the theorems of Gauss and Stokes in three-dimensional 
vector analysis, there are theorems that enable one to transform four- 
dimensional integrals into one another. Of these we will need only the 
theorem for transforming an integral over a four-volume into an 
integral over a hypersurface. 

The element of integration over a four-volume 



is a scalar; this is obvious from a comparison of the laws of transforma- 
tion of time intervals (35.1) and of spatial volumes (36.5). The element 
of integration over a hypersurface dS 1 " is a four-vector equal in magni- 




dQ = dx° dx 1 dx 2 dx 3 = c dt dV 



(38.7) 
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tude to the "area" of the hypersurface element and directed along the 
normal to the element (thus, the component dS° = dx dy dz, i.e. it is 
the three-dimensional volume element dV, the projection of the hyper- 
surface element on the hyperplane x° = constant). 

The integral over a closed hypersurface can be transformed into an 
integral over the four-volume it encloses, by replacing the element of 
integration dS M by an operator : 




(38.8) 



For example, for the integral of a vector A M we have 




(38.9) 



This formula is the generalisation of Gauss' theorem. 
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RELATIVISTIC MECHANICS 

§39. Energy and momentum 

Just as in classical mechanics, we shall start from the principle of 
least action in order to derive the relativistic equations of motion of 
particles. Let us start by finding the action integral for a free particle. 

This integral must not depend on our particular choice of inertial 
reference frame, i.e. it must be invariant under Lorentz transforma- 
tions. It follows that it must depend on a scalar. The integrand must be 
a first-order differential. But the only scalar of this kind that one can 
construct for a free particle is the interval ds, or ds multiplied by a 
constant characterising the particle. We shall denote this constant by 
— mc; the reason for this will be made clear in what follows. 

So for a free particle the action must have the form 

b 

S=-mc$ds, (39.1) 

a 

b 

where J is an integral along the world line of the particle between two 

a 

events — the presence of the particle at its initial and final positions at 
times h and ti. 

By using (35.1) we can rewrite this expression as an integral over the 
time: 

'2 

S = - mc* J Vtl ~ (fl 2 /c 2 )] dt. 

h 
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Comparing this with the general definition (2.1) 

5 = j Ldt, 

h 

we see that the relativistic Lagrangian for a free particle is 

L = -mcV[l-(» 2 /c 2 )]. (39.2) 

For low velocities, in the non-relativistic limit, we can expand L in 
powers of v/c. Omitting higher-order terms, we get 

L = — mc 2 + Ymv 2 . 

A constant term in the Lagrangian has no effect on the equations of 
motion, and can be dropped. We then get back the classical expression 
L = \mv 2 . We also see the significance of the constant m introduced 
in (39.1), which coincides with the mass of the particle. 

By the momentum of a particle we mean the vector p = 8L/3v. 
Differentiating (39.2), we find 

my 

P ~ VV-(v 2 lc 2 )] ' (39,3) 

For small velocities (v <k c) this expression goes over into the classical 

p = my. 

The time derivative of the momentum is the force acting on the 
particle. Suppose the velocity of the particle changes only in direction, 
that is, suppose the force is directed perpendicular to the velocity. Then 

dp m dv 



dt Vn-C^/c 2 )] dt 



(39.4) 



If the velocity changes only in magnitude, that is, if the force is parallel 
to the velocity, then 

dp m dv 



dt [1 -(v 2 /c 2 )] 3 ' 2 dt' 



(39.5) 



We see that the ratio of force to acceleration is different in the two 
cases. 
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According to the general definition (6.1), the energy E of the 
particle 

E=f\-L. (39.6) 
Substituting the expressions (39.2) and (39.3) for L and p, we find 

tnc 2 

£= VU-(»W (39-7) 

This very important formula shows, in particular, that in relativistic 
mechanics the energy of a free particle does not go to zero for v = 0, 
but rather takes on a finite value 

E = mc 2 . (39.8) 

This quantity is called the rest energy of the particle. 

For small velocities (v/c <k 1), we have, expanding (39.7) in series 
in powers of v/c, 

E % mc 2 +\mv 2 , 

which, except for the rest energy, is the classical expression for the 
kinetic energy of a particle. 

We emphasise that, although we speak of a "particle", we have 
nowhere made use of the fact that it is "elementary". Thus the for- 
mulae are equally applicable to any composite body consisting of 
many particles, where by m we mean the total mass of the body and by 
v the velocity of its motion as a whole. In particular, formula (39.8) 
is valid for any body which is at rest as a whole. We call attention to 
the fact that in relativistic mechanics the energy of a free body (i.e. 
the energy of any closed system) is a completely definite quantity 
which is always positive and is directly related to the mass of the body. 
In this connection we recall that in classical mechanics the energy of a 
body is defined only to within an arbitrary additive constant, and can 
be either positive or negative. 

The energy of a body at rest contains, in addition to the rest energies 
of its constituent particles, the kinetic energy of the particles and the 
energy of their interactions with one another. In other words, mc 2 
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is not equal to Zm a c 2 (where m a are the masses of the particles), 
and so m is not equal to £m a . Thus in relativistic mechanics the law of 
conservation of mass does not hold : the mass of a composite body is 
not equal to the sum of the masses of its parts. Instead only the law 
of conservation of energy, in which the rest energies of the particles 
are included, is valid. 

Squaring (39.3) and (39.7) and comparing the results, we get the 
following relation between the energy and momentum of a particle: 

£2/ c 2 = pt+mW. (39.9) 

The energy expressed in terms of the momentum is called the Hamilto- 
nian function H: 

H = cV0> 2 +mV). (39.10) 

For low velocities, p mc, and we have approximately 

H % mc 2 + (p 2 /2m), 

i.e., except for the rest energy we get the familiar classical expression 
for the Hamiltonian. 

From (39.3) and (39.7) we get the following relation between the 
energy, momentum, and velocity of a free particle : 

p = £v/c 2 . (39.11) 

For v = c, the momentum and energy of the particle become in- 
finite. This means that a particle with mass m different from zero cannot 
move with the velocity of light. Nevertheless, in relativistic mechanics, 
particles of zero mass moving with the velocity of light can exist.* 
From (39.11) we have for such particles: 

p = E/c. (39.12) 

The same formula also holds approximately for particles with non- 
zero mass in the ultra-relativistic case, when the particle energy E is 
large compared with its rest energy mc 2 . 



t Examples are light quanta and neutrinos. 
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§40. Four-momentum 

The arguments of the previous section still leave open the question 
of the transformation of the energy and momentum of a particle when 
we change from one reference frame to another. To answer this 
question we must make clear the four-dimensional nature of these 
quantities. 

From the usual three-dimensional velocity vector v of a particle 
we can also form a four-vector. This velocity four-vector is 

u" = <bc*/dj. (40.1) 

Expressing the element of interval dj in terms of the time differential 
dt according to (35.1) we can write 



From this we see that the components of this four-vector are 



\VV-(.v 2 /c 2 )V cV[\-(v 2 /<?)] 

The components of the four-velocity are not independent. Noting 
that dx u dx" = ds 2 , we find 

( 40 - 3 ) 

From the geometrical point of view vt is a unit four-vector tangent to 
the world line of the particle. 

The four-momentum of a particle is the four-vector 

p" = mcu". (40.4) 

Taking the components of the four-velocity from (40.2) and comparing 
with the expressions (39.3) and (39.7), we see that the components of 
the four-momentum are 

p» = (Efc, p). (40.5) 



(40.2) 
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Thus in relativistic mechanics momentum and energy are components 
of a single four-vector. From this we immediately get the formulae for 
transformation of these quantities. Substituting the expressions (40.5) 
in the general law (38.1) for transformation of four-vectors, we find 

_ p'tHVE'/c*) E'+ Vp' x 

Px y/[l -(KVc 2 )] ' P > P} " Pz P " V[l -(*"/«*)] ' 

(40.6) 

where p x , p y , p z are the components of the three-momentum p. 

From the definition (40.4) of the four-momentum and the identity 
(40.3) we have for the square of the four-momentum of a free particle 

P/l p" = m i(?. (40.7) 

Substituting the components pf from (40.5), we arrive at the relation 
(39.9). 

§41. Decay of particles 

Let us consider the spontaneous decay of a body of mass M into 
two parts with masses m\ and w 2 . The law of conservation of energy 
in the decay, applied in the frame of reference in which the body is at 
rest, gives* 

M = E 10 +E 20 , (41.1) 

where Eh> and £20 are the energies of the emerging particles. Since 
E10 > mi and £20 > m 2 , the equality (41.1) can be satisfied only if 
M > m 1 +m2, i.e. a body can disintegrate spontaneously into parts the 
sum of whose masses is less than the mass of the body. On the other 



t In §§41, 42 we put c = 1. In other words the velocity of light is taken as the 
unit of velocity (so that the dimensions of length and time become the same). This 
choice is a natural one in relativistic mechanics and greatly simplifies the writing 
of formulae. However, in this book (which also contains a considerable amount 
of non-relativistic theory) we shall not usually use this system of units, and will 
explicitly indicate when we do. 

If c has been put equal to unity in formulae, it is easy to convert back to ordinary 
units: the velocity of light is introduced to assure correct dimensions. 
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hand, if M -= mi+ m 2 , the body is stable (with respect to the particular 
decay) and does not decay spontaneously. To cause the decay in this 
case, we would have to supply to the body from outside an amount of 
energy at least equal to its binding energy (mi+m 2 —M). 

Momentum as well as energy must be conserved in the decay 
process. Since the initial momentum of the body was zero, the sum of 
the momenta of the emerging particles must be zero: P10+P20 = 0. 
Consequently p\ 0 = p\ 0 , or 

E^~m\ = E^-m\. (41.2) 



The two equations (41.1) and (41.2) uniquely determine the energies 
of the emerging particles: 

El0 = 2M ' E2 °- 2M • (41J) 

In a certain sense the inverse of this problem is the calculation of the 
total energy M of two colliding particles in the frame of reference in 
which their total momentum is zero (the centre-of-mass system or 
C system). The computation of this quantity gives a criterion for the 
possible occurrence of various inelastic collision processes, accompa- 
nied by a change in state of the colliding particles, or the "creation" of 
new particles. A process of this type can occur only if the sum of the 
masses of the "reaction products" does not exceed M. 

Suppose that in the initial reference frame (the laboratory system or 
L system) a particle with mass mi and energy E\ collides with a particle 
of mass m 2 which is at rest. The total energy of the two particles is 

E = E!+E 2 = Ei + m 2 , 

and their total momentum is p = P1+P2 = Pi- Considering the two 
particles together as a single composite system, we find the velocity of 
its motion as a whole from (39.11): 
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This quantity is the velocity of the C system with respect to the L 
system. 

However, in determining the mass M, there is no need to transform 
from one reference frame to the other. Instead we can make direct 
use of formula (39.9), which is applicable to the composite system just 
as it is to each particle individually. We thus have 

from which 

M 2 = m\+ml+2m 2 Ei . (41.5) 
PROBLEM 

Determine the maximum energy which can be carried off by one of the decay 
particles, when a particle of mass M at rest decays into three particles with masses 
m l7 m 2 and m 3 . 

Solution. The particle m 1 has its maximum energy if the system of the other two 
particles m 2 and m 3 has the least possible mass ; the latter is equal to the sum m 2 + m 3 
(and corresponds to the case where the two particles move together with the same 
velocity). Having thus reduced the problem to the decay of a body into two parts 
we obtain from (41.3) 

M'+mj-^+m,) 8 
'"" 2M 



§42. Elastic collisions of particles 

Let us consider, from the point of view of relativistic mechanics, the 
elastic collision of particles. We denote the momenta and energies of 
the two colliding particles (with masses mi and m 2 ) by pi, Ei and p 2 , 
E 2 ; we use primes for the corresponding quantities after collision. 
The laws of conservation of momentum and energy in the collision 
can be written together as the equation for conservation of the four- 
momentum : 

P"i+P$ = P'f+P'f. (42.1) 

From this four-vector equation we construct invariant relations which 
will be helpful in further computations. To do this we rewrite (42.1) 
in the form 
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and square both sides (i.e. we write the scalar product of each side with 
itself). Noting that the squares of the four-momenta p'l and p'f are 
equal to ml, and the squares of p\ and p'£ are equal to m\, we get 

ml + PlM pZ -p^p? -p^? = 0. (42.2) 

Similarly, squaring the equation p\-Vp%—p' 2 — p'f, we get 

"i+p^-PuPf-PivP? = 0. (42.3) 

Let us consider the collision in a reference frame (the L system) in 
which one of the particles (m 2 ) was at rest before the collision. Then 
P2 = 0, E 2 = m 2 , and the scalar products appearing in (42.2) are 



PimP2 — E l m 2> 
p^p'x = "hE[, 

PvxPi = EiE[ -Pi -Pi = -p^ cos 6 b 



(42.4) 



where %\ is the angle of scattering of the incident particle mi. Substitut- 
ing these expressions in (42.2), we get 

oose^^^-^-^ . (42.5) 
PiPi 

Similarly, we find from (42.3) 

cos6 2 = {El+nhd{E r nh) , (42.6) 
P1P2 

where 6 2 is the angle between the transferred momentum p' 2 and the 
momentum of the incident particle pi. These formulae relate the angles 
of scattering of the two particles in the L system to the changes in 
their energy in the collision. 

We note that if mi =- m 2 , i.e. if the incident particle is heavier than 
the target particle, the scattering angle 61 cannot exceed a certain 
maximum value. It is easy to find by elementary computations that 
this value is given by the equation 

sin 6 lmax = m 2 /mi, (42.7) 



which coincides with the familiar classical result (14.8). 
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Formulae (42.5), (42.6) simplify in the case when the incident particle 
has zero mass : m 1 = 0, and correspondingly p 1 = E v p\ = E' v For 
this case let us write the formula for the energy of the incident particle 
after the collision, expressed in terms of its angle of deflection: 

£' _ ^ 8) 

1 l-cosd 1 + m 2 /E 1 ' K ' ' 

Let us now turn once again to the general case of collision of 
particles of arbitrary mass. The collision is most simply treated in the 
C system. Designating quantities in this system by the additional 
subscript 0, we have pio = — P20 = Po- From the conservation of 
momentum, during the collision the momenta of the two particles 
merely rotate, remaining equal in magnitude and opposite in direction. 
From the conservation of energy, the value of each of the momenta 
remains unchanged. 

Let % be the angle of scattering in the C system — the angle through 
which the momenta pi 0 and P20 are rotated by the collision. This 
quantity completely determines the scattering process in the C system, 
and therefore also in any other frame of reference. It is also convenient 
in describing the collision in the L system and serves as the single 
parameter which remains undetermined after the laws of conservation 
of momentum and energy are applied. 

We express the final energies of the two particles in the L system in 
terms of this parameter. To do this we return to (42.2), but this time 
write out the product p^p'i in the C system : 

PvPi = -Eio^io-Pio-Pio = £10 -Po cos X = />o(l -cos x) + ™\ 

(in the C system the energies of the particles do not change in the 
collision : E' 10 = E 10 ). We write out the other two products in the 
L system, i.e. we use (42.4). As a result we get E[— E 1 = —(pl/m 2 )X 
X (1 — cos %). We must still express pi in terms of quantities referring to 
the L system. This is easily done by equating the values of the in- 
variant p^pZ in the L and C systems: 



£10^20— Pio*P2o = Eim 2 , 
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or 

VKti + nftiPo + n®] = Ei m 2-Po- 
Solving the equation for p% we get 

_ mKEf-mf) 
r °- ml + ml + lm.E, ' K ™ } 

Also using the conservation law E 1 +m 2 = is^+i^, we finally get 



m 2 (El-m\) 
ml+ml + lniiEx 



^-ff (l-cosg). (42.10) 



This expression represents the energy lost by the first particle and 

transferred to the second particle. The maximum energy transfer 

occurs for %=n, and is equal to 

p< m _ F F . _ 2m 2 (Ef-ml) 

E 2 max ~m 2 min - ml + ml + 2m2Ei ■ (42. 1 1) 

The ratio of the minimum kinetic energy of the incident particle 
after collision to its initial kinetic energy is : 



E 1 — m 1 mf+ml+ 2m 2 E 1 

In the limiting case of low velocities (when E ^ m+^mv 1 ), this ratio 
tends to a constant limit, equal to 



/ m 1 -m 2 V 2 
\ m 1 + m 2 ) 



In the opposite limit of large energies Ei, the ratio (42.12) tends to 
zero ; the quantity E[ min tends to a constant limit. This limit is 

ml+ml 



l min 



2m i 



Let us assume that m 2 » mi, i.e. the mass of the incident particle 
is small compared with the mass of the particle at rest. According to 
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classical mechanics the light particle could transfer only a negligible 
part of its energy (see § 14). This is not the case in relativistic mechanics. 
From formula (42.12) we see that for sufficiently large energies Ey 
the fraction of the energy transferred can reach the order of unity. For 
this it is not sufficient that the velocity of my be of order 1 , but one 
must have Ey ~ m 2 , i.e. the light particle must have an energy of the 
order of the rest energy of the heavy particle. 

A similar situation occurs for m 2 <c my, i.e. when a heavy particle 
is incident on a light one. Here too, according to classical mechanics, 
the energy transfer would be insignificant. The fraction of the energy 
transferred begins to be significant only for energies E x ~ m\\m 2 . 
We note that we are not talking simply of velocities of the order of the 
light velocity, but of energies large compared with my, i.e. we are 
dealing with the ultra-relativistic case. 



PROBLEMS 

Problem 1. For two particles of equal mass, find the separation angle after 
collision, in the laboratory system. 

Solution. Taking the squares of both sides of (42.1), we get 

and, after writing out the products of the four-momenta in the L system, 

£ , 1 m 2 = E[E' 2 — PiP' 2 cos®, 

where & is the separation angle (the angle between p[ and pj). For particles of 
equal mass (m 1 = m 2 = m), we substitute 

p[ = VW-m*), p' 2 = vW-m*). 

and using the conservation of energy (E l + m = E'\+E£) we obtain 

'{E[-m){E' z -m) 



cos © 



+ m)(E'+m) ' 



The angle 0 varies from \n (for E[ -» m or E^ -» m) to a minimum value @ m i„, 
which is reached when E[ = E' t : 
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Problem 2. For the collision of two particles of equal mass m, express E' u Ei 
and x in terms of the scattering angle 9 t in the laboratory system. 

Solution. Substituting/^ = V(El-m 2 ),pi = V(£i' ! - »» 2 ) in (42.5) and solving 

the equation for £7, we find 

„, = £i + w+(£i-w)cos 2 0 t 
1 ~ E l + m-(E 1 -m) cos 2 0 t 

and so 

(E?-m 2 )sin 2 0, 



i?2 = Ei + m-E,' = m + - 



2m + (E l -m) sin 2 0, 
Comparing with the expression for Ei in terms of 

E 'i = £,-K£i-m)(l-cos x) 
(from (42.10)), we find the angle of scattering in the centre-of-mass system: 

2m-(£! + 3m)sin 2 0j 



cos x 



2m+(E I +m)sin 2 0 I 
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CHARGES IN ELECTROMAGNETIC 
FIELDS 

§43. Four-potential of a field 

The interaction of particles can be described with the help of the 
concept of a field of force. Namely, instead of saying that one particle 
acts on another, we may say that the particle creates a field around 
itself ; a certain force then acts on every other particle located in this 
field. In classical mechanics, the field is merely a mode of description 
of the physical phenomenon — the interaction of particles. In the theory 
of relativity, because of the finite velocity of propagation of interactions, 
the situation is changed fundamentally. The forces acting on a particle 
at a given moment are not determined by the positions at that same 
moment. A change in the position of one of the particles influences 
other particles only after the lapse of a certain time interval. This means 
that the field itself acquires physical reality. We cannot speak of a 
direct interaction of particles located at a distance from one another. 
Interactions can occur at any one moment only between neighbouring 
points in space (contact interaction). Therefore we must speak of the 
interaction of the one particle with the field, and of the subsequent 
interaction of the field with the second particle. 

The second part of this book is devoted to the theory of electro- 
magnetic fields. We start by studying the interaction of a particle with 
a given field. 

For a particle moving in a given electromagnetic field, the action is 
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made up of two parts : the action (39.1) for the free particle, and a term 
describing the interaction of the particle with the field. The latter term 
must contain quantities characterising the particle and quantities 
characterising the field. 

It turns out t that the properties of a particle with respect to inter- 
action with the electromagnetic field are determined by a single 
parameter — the charge e of the particle, which can be either positive or 
negative (or equal to zero). The properties of the field are characterised 
by a four-vector the four-potential, whose components are functions 
of the coordinates and time. These quantities appear in the action 
function in the term 

b 

6 A^dx", 



where the functions A M are taken at points on the world line of the 
particle. The factor 1/c has been introduced for convenience. It should 
be pointed out that, so long as we have no formulae relating the 
charge or the potentials with already known quantities, the units for 
measuring these new quantities can be chosen arbitrarily. (We shall 
return to this question in §53.) 

Thus the action function for a charge in an electromagnetic field 
has the form 

b 

s = 



mcds-—A„dxfy (43.1) 



t The assertions which follow should be regarded as being essentially the con- 
sequence of experimental data. The form of the action for a particle in an electro- 
magnetic field cannot be fixed on the basis of general considerations alone, such 
as, for example, the requirement of relativistic invariance. The latter would permit, 
for example, the occurrence in formula (43.1) of a term of the form J" A ds, where 
A is a scalar function. 

To avoid any misunderstanding, we repeat that we are considering classical 
(and not quantum) theory, and therefore do not include effects which are related 
to the spins of particles. 
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The three space components of the four-vector A M form a three- 
dimensional vector A called the vector potential of the field. The time 
component is called the scalar potential; we denote it by A 0 = 
Thus 

A" = (4, A). (43.2) 
Therefore the action integral can be written in the form 

^ — mcds+^A-di— e(j> dfj . 



b 

s = 



Introducing drjdt = v, the particle velocity, and changing to an 
integration over t, 

,S = | ^-m<*^[l-(v*/<*)]+jA>'r-e<l^ dt. (43.3) 



The integrand is just the Lagrangian for a charge in an electromagnetic 
field: 

L = -fficVll -(iWl+^A-v-etf.. (43.4) 



This function differs from the Lagrangian for a free particle by the 
terms (e/c)A-y—e(j), which describe the interaction of the charge with 
the field. 

The derivative dL/dy is the generalised momentum of the particle ; 
we denote it by P : 

P = /n Tiim +- A = P+— A. (43.5) 
y[l — (^/c 2 )] c c 

Here we have denoted by p the ordinary momentum of the particle, 
which we shall refer to simply as its momentum. 
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From the Lagrangian we can find the Hamiltonian function for a 
particle in a field from the general formula* 



dL 

v " "a L - 

8v 



Substituting (43.4), we get 



^vuhW*- (43 ' 6) 

However, the Hamiltonian must be expressed not in terms of the 
velocity, but rather in terms of the generalised momentum of the 
particle. From (43.5) and (43.6) it is clear that the relation between 
76— e<f) and P — (e/c)A is the same as the relation between 76 and p in 
the absence of the field, i.e. 

(^^) 2 = ^+(p-|a) 2 . (43.7) 

For low velocities, i.e. for classical mechanics, the Lagrangian (43.4) 
goes over into 

L = i/m> 2 +-^A-v-e</>. (43.8) 



In this approximation 



e 

p = ms = P A, 

c 



and we find the following expression for the Hamiltonian : 



§44. Equations of motion of a charge in a field 

A charge located in a field not only is subjected to a force exerted 
by the field, but also in turn acts on the field, changing it. However, 
if the charge e is not large, the action of the charge on the field can be 

t In this part of the book we shall use script letters & and 38 (instead of £ and H) 
for the energy and the Hamiltonian, in order to avoid confusion with the field 
Strengths, 
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neglected. * In this case, when considering the motion of the charge 
in a given field, we may assume that the field itself does not depend on 
the coordinates or the velocity of the charge. 

The equations of motion of a charge in a given electromagnetic 
field are given by the Lagrange equation 



~dt \ ~ 



5M £• 



where L is given by formula (43.4). 

The derivative dL/dv is the generalised momentum of the particle 
(43.5). Further, we write 

= vL — — grad A-v— egrad S. 
9r c 

But from a formula of vector analysis 

grad(a-b) = (a-v)b+(b-v)a+bXcurla+axcurlb, 

where a and b are two arbitrary vectors. Applying this formula to 
A-v, and remembering that differentiation with respect to r is carried 
out for constant v, we find 

e , . c . . . , 

= — (v-v)Ah — vXcurl A — egrad </». 
or c c 

So the Lagrange equation has the form 

d / e \ e e 
lit v~*~~c^) = ~c^ v '^^^"c v -^ cur ^ — e S ra< ^ 4>- 

But the total differential (dA/df) df consists of two parts: the change 
(dA/dt)dt of the vector potential with time at a fixed point in space, 
and the change due to motion from one point in space to another at 



t The condition for smallness of the charge in this sense is that the radiation 
damping forces arising from its motion (cf. the discussion in §84) be small. 
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distance dr. This second part is equal to (dr-v)A. Thus 

dA 9A , , ^* 
_ = _ + (v . v )A. 

Substituting this in the previous equation, we find 

^5- = _i.-|i_egrad0+^-vXcurlA. [(44.2) 

This is the equation of motion of a particle in an electromagnetic 
field. On the left-hand side stands the derivative of the particle's 
momentum with respect to the time. Therefore the expression on the 
right of (44.2) is the force exerted on the charge in an electromagnetic 
field. We see that this force consists of two parts. The first part (first 
and second terms on the right-hand side of (44.2)) does not depend on 
the velocity of the particle. The second part (third term) depends on the 
velocity, being proportional to the velocity and perpendicular to it. 

The force of the first type, per unit charge, is called the electric 
field intensity; we denote it by E. So, by definition, 

E = ---^-gradtf.. (44.3) 
c at 

The coefficient of v/c in the force of the second type, per unit charge, 
is called the magnetic field intensity. We designate it by H. So, by 
definition, 

H = curl A. (44.4) 

The equation of motion of a charge in an electromagnetic field can 
now be written as 

d P = eE+-vXH. (44.5) 
at c 



The expression on the right is called the Lorentz force. The first term 
(the force which the electric field exerts on the charge) does not depend 
on the velocity of the charge, and is along the direction of E. The second 
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part (the force exerted by the magnetic field on the charge) is pro- 
portional to the velocity of the charge and is directed perpendicular 
to the velocity and to the magnetic field H. 

For velocities small compared with the velocity of light, the momen- 
tum p is approximately equal to its classical expression mv, and 
the equation of motion (44.5) becomes 

m~ = eE + -vXH. (44.6) 
at c 

Next we calculate the rate of change of the kinetic energy of the 
particle* with time, i.e. the derivative 

dS kiD _d_ / mc 1 \ 
dt dt\ <v/[l-(u 2 /c 2 )] /' 

It is easy to check that 

dlkin dp 
dt dt ' 



Substituting dp/df from (44.5) and noting that vXH-v = 0, we have 

= cE-T. (44.7) 



The expression on the right-hand side is the work done by the field 
on the particle per unit time. Work is done on the charge only by the 
electric field ; the magnetic field does no work on a charge moving in 
it, since the force which the magnetic field exerts on a charge is always 
perpendicular to the velocity of the charge. 

In §5 it was noted that the equations of classical mechanics are 
invariant with respect to a change in sign of the time. It is easy to see 
that this is also valid for the electromagnetic field in the theory of 
relativity. In this case, however, in addition to changing t into — t, 
we must reverse the sign of the magnetic field. It is easy to see that the 



t By "kinetic" we mean the energy (39.7), which includes the rest energy. 
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equations of motion (44.5) are not altered if we make the changes 

t-*-t, E-E, H--H. (44.8) 

According to (44.3) and (44.4), this does not change the scalar 
potential, while the vector potential changes sign : 

4 -+ 4, A - -A, (44.9) 

Thus, if a certain motion is possible in an electromagnetic field, then 
the reversed motion is possible in a field in which the direction of H is 
reversed. 

PROBLEM 

Express the acceleration of a particle in terms of its velocity and the electric 
and magnetic field intensities. 

Solution. Substitute in the equation of motion (44.5) p = v<5 k i n /c 2 , and take 
the expression for d£ kl Jdt from (44.7). The result is 

v = ~ V[l-(f 2 /c ! )] |e+~vXH- ^-v(v-E)J . 

§45. Gauge invariance 

Let us consider to what extent the potentials are uniquely determined. 
The field is characterised by the effect which it produces on the motion 
of a charge located in it. But in the equation of motion (44.5) there 
appear not the potentials, but the field intensities E and H. Therefore 
two fields are physically identical if they are characterised by the same 
vectors E and H. 

If we are given potentials A and </>, then these uniquely determine 
(according to (44.3) and (44.4)) the fields E and H. However, to one 
and the same field there can correspond different potentials. To show 
this, let us add to each component of the four-potential the quantity 
— dfjdx^, where / is an arbitrary function of the coordinates and the 
time. Then the potential A M goes over into 



(45.1) 
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As a result of this change there appears in the action integral (43.1) 
the additional term 




(45.2) 



which is a total differential and has no effect on the equations of 
motion (cf. §2). 

If in place of the four-potential we introduce the scalar and vector 
potentials, and in place of xf, the coordinates ct, x, y, z, then the four 
equations (45.1) can be written in the form 

A' = A+grad/, <j>' = (j> ~ -£ . (45.3) 

It is easy to check that electric and magnetic fields determined from 
equations (44.3) and (44.4) actually do not change upon replacement 
of A and <j> by A' and <j)'. Thus the transformation (45.3) does not 
change the fields. The potentials are therefore not uniquely defined; 
the vector potential is determined to within the gradient of an arbitrary 
function, and the scalar potential to within the time derivative of the 
same function. 

In particular, we can add an arbitrary constant vector to the vector 
potential, and an arbitrary constant to the scalar potential. This is 
also clear directly from the fact that the definitions of E and H contain 
only derivatives of A and <j), and therefore the addition of constants to 
the latter does not affect the field intensities. 

Only those quantities have physical meaning which are invariant 
with respect to the transformation (45.3) of the potentials ; in particular 
all equations must be invariant under this transformation. This in- 
variance is called gauge invariance. 1 ' 

This non-uniqueness of the potentials gives us the possibility of 
choosing them so that they fulfil one auxiliary condition chosen by us. 
We emphasise that we can set one condition, since we may choose the 



t We emphasise that this is related to the assumed constancy of e in (45.2). 
Thus the gauge invariance of the equations of electrodynamics and the conservation 
of charge are closely related to one another. 
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function / in (45.3) arbitrarily. In particular, it is always possible to 
choose the potentials so that <p is zero. If the vector potential is not 
zero, it is not generally possible to make it zero, since the condition 
A = 0 represents three auxiliary conditions (for the three components 



By a constant electromagnetic field we mean a field which does not 
depend on the time. Clearly the potentials of a constant field can be 
chosen so that they are functions only of the coordinates and not of the 
time. A constant magnetic field is equal, as before, to H = curl A. 
A constant electric field is equal to 



Thus a constant electric field is determined only by the scalar potential 
and a constant magnetic field only by the vector potential. 

We saw in the preceding section that the potentials are not uniquely 
determined. However, if we describe the constant electromagnetic 
field in terms of potentials which do not depend on the time, then we 
can add to the scalar potential, without changing the fields, only an 
arbitrary constant (not depending on either the coordinates or the 
time). Usually <j> is subjected to the additional requirement that it 
be zero at infinity. Thus the arbitrary constant previously mentioned is 
determined, and the scalar potential of the constant field is thus deter- 
mined uniquely. On the other hand, the vector potential is not uniquely 
determined : we can add to it the gradient of an arbitrary function of 
the coordinates. 

We now determine the energy of a charge in a constant electro- 
magnetic field. If the field is constant, then the Lagrangian for the 
charge also does not depend explicitly on the time. In this case the 
energy is conserved and coincides with the Hamiltonian. According to 
(43.6), we have 



of A). 



§46. Constant electromagnetic field 



E = — grad <j>. 



(46.1) 



6 = 



mc 2 



+e<j>. 



(46.2) 



V[l -&!<*)] 
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Thus the presence of the field adds to the energy of the particle 
the term e<j>, the potential energy of the charge in the field. We note the 
important fact that the energy depends only on the scalar and not on 
the vector potential. This means that the magnetic field does not affect 
the energy of the charge. Only the electric field can change the energy 
of the particle. This is related to the fact already mentioned that the 
magnetic field, unlike the electric field, does no work on the charge. 

If the field intensities are the same at all points in space, then the 
field is said to be uniform. The scalar potential of a uniform electric 
field can be expressed in terms of the field intensity as 

<£ = -E-r; (46.3) 

since E = constant, 

v(E-r) = (E-v)r = E. 

The vector potential of a uniform magnetic field can be expressed 
in terms of its field intensity as 

A = |HXr; (46.4) 

for H = constant, we obtain with the aid of well-known formulae of 
vector analysis 

curl(HXr) = Hdivr-(H-v)r = 2H 
(noting that div r = 3). 

§47. Motion in a constant uniform electric field 

Let us consider the motion of a charge e in a uniform constant 
electric field E. We take the direction of the field as the x axis. The 
motion will obviously proceed in a plane, which we choose as the xy 
plane. Then the equations of motion (44.5) become 

p x = eE, p y = 0, 

so that 

p x = eEt, p y =p 0 . (47.1) 

The time reference point has been chosen at the moment when p x = 0; 
Po is the momentum of the particle at that moment. 
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The kinetic energy of the particle (the energy omitting the potential 
energy in the field) is c^kin — c -y/(w 2 c 2 +/? 2 ). Substituting (47.1), we 
find in our case 

c5 ki „ = V["te* + *fiHceEty\ = VffiHceEtW, (47.2) 

where £ 0 is the energy at t = 0. 

According to (39.11) the velocity of the particle is v = pc 2 /c5 kin . 
For the velocity v x = x we have therefore 

dx p x c 2 c 2 eEt 



Integrating, we find 



x = ~ E VVlHceEtn (47.3) 



The constant of integration we set equal to zero. 
For determining y, we have 

dy p y c 2 p 0 c 2 



from which 

We obtain the equation of the path by expressing t in terms of y from 
(47.4) and substituting in (47.3). This gives 

x = — =-cosh . (47.5) 

eE poc 

Thus in a uniform electric field a charge moves along a catenary 
curve. 

If the velocity of the particle is v «*: c, then we can take p 0 = mv 0 , 
So = mc 2 , and expand (47.5) in series in powers of 1 /c. Then we get, 
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to within terms of higher order, 

eE 

x= y 2 + constant, 

that is, the charge moves along a parabola, a result well known from 
classical mechanics. 

§48. Motion in a constant uniform magnetic field 

We now consider the motion of a charge e in a uniform magnetic 
field H. We choose the direction of the field as the z axis. We rewrite 
the equation of motion 

p = -vXH 
c 

in another form, by substituting for the momentum 

P = "^> 

where & is the energy of the particle, which is constant in the magnetic 
field. The equation of motion then goes over into the form 

l4f = 7 vXH < 48 ' 1 > 

or, expressed in terms of components, 

v x = covy, i) y — —cov x , v 2 = 0, (48.2) 

where we have introduced the notation 

a>= -g-. (48.3) 



We multiply the second equation (48.2) by i, and add it to the first : 

_d_ 
dt 



(vx + ivy) = -ico(v x +iVy), 
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so that 

v x + ivy = ae 

where a is a complex constant. This can be written in the form a = 
— v ot e~'*, where v ot and a are real. Then 

Vx+ivy = vve-'^'+v 

and, separating real and imaginary parts, we find 

v x = v ot cos (cot+x), Vy = — v 0 , sin (cot+x). (48.4) 

The constants v ot and a are determined by the initial conditions; a is 
the initial phase, and from (48.4) it is clear that 

for = V0£+«J). 

that is, v ot is the velocity of the particle in the xy plane, and stays 
constant throughout the motion. 
From (48.4) we find, integrating once more, 

x = x 0 +r sin (cot+x), y = yo+r cos (cot+x), (48.5) 

where 

r = ^ = ^ = c RL (48.6) 
co ecH eH 



(p, is the projection of the momentum on the xy plane). From the 
third equation (48.2), we find v 2 = v 0z and 

z = z 0 +v 02 t. (48.7) 

From (48.5) and (48.7), it is clear that the charge moves in a uniform 
magnetic field along a helix having its axis along the direction of the 
magnetic field and with a radius r given by (48.6). The velocity of the 
particle is constant. In the special case where v 0z = 0, that is, the 
charge has no velocity component along the field, it moves along a 
circle in a plane perpendicular to the field. 

The quantity co, as we see from the formulae, is the angular frequency 
of rotation of the particle in the plane perpendicular to the field. 
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If the velocity of the particle is low, then we can approximately put 
& = mc 2 . Then the frequency co is changed to 

eH 

a> = — . (48.8) 
mc 



PROBLEMS 

Problem 1. Find the adiabatic invariant for the motion of a charge in a uniform 
magnetic field whose magnitude and direction vary slowly with time. 

Solution. Since the motion in the plane perpendicular to the magnetic field is 
periodic, the adiabatic invariant is the integral (cf. §32) 



taken over a complete period of the motion, i.e. over the circumference of a circle 
in the present case (P, is the projection of the generalised momentum on the plane 
perpendicular to H). Substituting P, = p, + (e/c)A, we have 



In the first term we note that p, is constant in magnitude and directed along dl; 
we apply Stokes' theorem to the second term and write curl A = H: 

where r is the radius of the orbit. Substituting the expression (48.6) for r, we find 

/= M. 
2eH ' 

From this we see that, for slow variation of H, the transverse momentum p, varies 
proportionally to \/H. 

Problem 2. Determine the frequencies of vibration of a charged space oscillator, 
placed in a constant, uniform magnetic field ; the eigenfrequency of vibration of the 
oscillator (in the absence of the field) is co 0 . 

Solution. The equations of forced vibration of the oscillator in a magnetic field 
(directed along the z axis) are 

x+mhc — ^-y, y+coh>=-—~x, z+colz = 0. 
mc mc 
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Multiplying the second equation by i and adding to the first, we find 

eH ■ 
mc 

where I = x+iy. From this we find that the frequencies of vibration of the oscillator 
in a plane perpendicular to the field are 



" eH 

~ 2mc 



If the field H is weak, this formula goes over into 

co = Wn + eHllmc. 
The vibration along the direction of the field remains unchanged. 



§49. Motion of a charge in crossed fields 

Finally we consider the motion of a charge in the case where there 
are present both electric and magnetic fields, constant and uniform. 
We limit ourselves to the non-relativistic case, where the velocity of the 
charge « « c, so that its momentum p = my; as we shall see later, it is 
necessary for this that the electric field be small compared with the 
magnetic field. 

We choose the direction of H as the z axis, and the plane passing 
through H and E as the yz plane. Then the equation of motion 

im = eE + — vXH 
c 

can be written in the form 

e e 
mx = — yH, my — eE y — —xH, mz = eE z . (49.1) 

From the third equation we see that the charge moves with uniform 
acceleration in the z direction, that is, 



eE 



(49.2) 
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Multiplying the second equation (49.1) by i and adding to the first, 
we find 

~ (x + iy) + icoix + iy) = i~E y 

(ft) = eH/mc). The integral of this equation is equal to the sum of the 
integral of the same equation without the right-hand term and a 
particular integral of the equation with the right-hand term. The first 
of these is ae~'°", the second is eEJmca = cE y /H. Thus 

x+iy = ae~"°'+cE y /H. 

The constant a is in generaltomplex. Writing it in the form a — be'*, 
with real b and a, we see that, since a is multiplied by e _to ', we can, 
by a suitable choice of the time origin, give the phase a any arbitrary 
value. We choose this so that a is real. Then breaking up x+iy into 
real and imaginary parts, we find 

x — a cos cot +cE y /H, y = —asincot. (49.3) 

At t = 0 the velocity is along the x axis. We see that the components 
of the velocity of the particle are periodic functions of the time ; their 
average values are 

v x = cEyl H, v y = 0. (49.4) 

This average velocity of a charge in crossed electric and magnetic 
fields is often called the electromagnetic drift velocity. Its direction is 
perpendicular to both fields and is independent of the sign of the 
charge. 

All the formulae of this section assume that the velocity of the 
particle is small compared with the velocity of light ; we see that, for 
this to be so, it is necessary in particular that the electric and magnetic 
fields satisfy the condition 

E y /H«l, (49.5) 

while the absolute magnitudes of E y and H can be arbitrary. 
Integrating equation (49.3) again, and choosing the constant of 
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integration so that at / = 0, x = y = 0, we obtain 

a cE ct 

x — — sin a>t+—~t; v = — (cos cor- 1). (49.6) 
co H co 

Considered as parametric equations of a curve, these equations 
define a trochoid. Depending on whether a is larger or smaller in 



v 




v 



Fig. 29 



absolute value than the quantity cE y /H, the projection of the trajectory 
on the xy plane has the forms shown in Figs. 29a and 29b, respectively. 
If a = —cE y /H, then (49.6) becomes 

cE 

y :(pt— sin <ot), 

(49.7) 

y = -^u -cos co/); 



x = 



coH 



coH v 



that is, the projection of the trajectory on the xy plane is a cycloid 
(Fig. 29c). 
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§50. The electromagnetic field tensor 

Formulae (44.3), (44.4), expressing the field intensities in terms of the 
potentials, are written in three-dimensional form and are therefore 
not suitable for getting the law of transformation of these quantities 
under a change of reference frame. 

It is easy to see that the set of components of the two three-dimen- 
sional vectors E and H can be described as the components of an 
antisymmetric four-tensor : 

f » = #-tF <*»> 

(called the electromagnetic field tensor). The meaning of the individual 
components can be seen by substituting the values = (<j>, —A) in 
the definition (50.1). The result can be written in the form of a matrix 
in which the index /j, = 0, 1, 2, 3 labels the rows, and the index v the 
columns : 



F„=\~7 „ n "J\- (50.2) 



0 


E x 


Ey 


E z 


-E x 


0 




Hy 




H z 


0 




-E z 


-H, 


H x 


0 , 



The contravariant components of the same tensor have a sign change 
whenever a spatial index is raised : 

/0 -E x -E y -E z 

\E, -H y H x 0 

We note that the equations of motion of a charge in the field can be 
written, using the tensor F^, in the form 

*g- = Lp*u,. (50.4) 
as c 
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Writing out the expressions on both sides of the equation using the 
three-dimensional notation (40.2), (40.5), and (50.3) (and substituting 
ds = c dfy^l — (©Vc 2 )], we see that for fx = 1, 2, 3 we get the three 
components of the vector equation (44.6), while for fi = 0 we find 
the work equation (44.7). 

The transformation formulae for the fields E and H can now be 
found using the general rules for transformation of four-tensors. The 
components of the four-tensor of rank two transform like a product 
of coordinates x^x". Under the Lorentz transformation (36.3) the 
coordinates x 2 = y and x 3 = z do not change, so the component F 23 
does not change : 

For the same reason, the components F 02 , F 03 and F 12 , F 13 transform 
respectively like the coordinates x° = ct and x 1 = x : 

F"*+(KF™/c) = F'"+(KF"*/c) 

V[l - (*"/«■)] ' V[i-(^/c a )l 

and similarly for F 03 , F 13 . Finally, the component F 01 should transform 
like the product xfx 1 ; one would then get 

1 f \y* r : y l 

poi — : J F'oi i _ p>io i _1_ (f' 01 + F' 10 1 1 

l-(V*l<*) \ c 2 + c K + ; J* 

But since in the present case the tensor F f " is antisymmetric, F' 01 = 
= — F' 10 and so 

F 01 - F' 01 . 

Now expressing the components of the tensor F t ' t in terms of the 
components of the fields E and H according to (50.3), we find the 
following transformation formulae for the electric field : 

E -E> E E 'y + ( VH '^ p E '*-iy H 'yl c ) (505) 
and for the magnetic field : 

_ h;-(ve' 2 /c) _ h' 2 +(ve;/c) 
* " ' ~ V[i -(^/c 2 )] ' * ~ VU -(^/c 2 )] " 
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Thus the electric and magnetic fields, like the majority of physical 
quantities, are relative ; that is, their properties are different in different 
reference frames. In particular, the electric or the magnetic field can be 
equal to zero in one reference frame and at the same time be present 
in another frame. 

If the magnetic field H' = 0 in the frame K', then, on the basis of 
(50.5) and (50.6), the following relation exists between the electric 
and magnetic fields in the frame K: 

H = -VxE. (50.7) 

c 

If, in the frame K', E' = 0, then in the frame K 

E = ---VXH. (50.8) 
c 

Consequently, in both cases, in the frame K the magnetic and electric 
fields are mutually perpendicular. These formulae also have a signific- 
ance when used in the reverse direction: if the fields E and H are 
mutually perpendicular (but not equal in magnitude) in some reference 
frame K, then there exists a reference frame K' in which the field is 
purely electric or purely magnetic. 

§51. Invariants of the field 

From the electric and magnetic field intensities we can form invariant 
quantities, which remain unchanged in the transition from one inertial 
frame of reference to another. 

We obtain such a quantity by forming the four-dimensional scalar 
F^F" 1 . Writing it out in three-dimensional notation, we find that 
F^F"* = 2{H 2 —E?). Thus one of the required invariants is the 
quantity 

IP — E 2 = invariant. (51.1) 

By a direct check using formulae (50.5), (50.6) one easily verifies 
that the sum E x H x +E }l H y +E 2 H 2 is left unchanged by a Lorentz 
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transformation Thus 

EH = invariant. (51.2) 

There is, however, a fundamental difference between these two 
invariants with respect to their behaviour under reflection {inversion) 
of the spatial coordinate system (simultaneous change of sign of 
x, y, z). We recall that under such a transformation the components of 
a true (polar) vector change sign. But the components of a vector 
that can be written as the vector product of two polar vectors remain 
unchanged under inversion (such vectors are said to be axial). The 
scalar product of two axial or two polar vectors is a true scalar — it is 
not changed by inversion. The scalar product of an axial and a polar 
vector is a pseudoscalar — it changes sign under inversion. 

But according to the definition (44.3), (44.4), E is a polar vector 
while H is an axial vector (the vector product of the polar vectors 
V and A). It is then clear that IP—E 2 is a true scalar, while E-H is a 
pseudoscalar (while its square (E-H) 2 will be a true scalar). 

We note some consequences of the invariance of the expressions 
(5 1 . 1), (5 1 .2). If in some frame of reference the fields E and H are equal 
in magnitude (E 2 = H 2 ), then they are equal in magnitude in every 
other inertial reference frame. If the fields E and H are perpendicular 
in some reference frame (E«H = 0), then they are also perpendicular 
in every other frame. 

If E-H = 0, we can find a frame of reference in which either 
E = 0 or H = 0 (depending on whether E 2 -!! 2 < 0 or > 0), i.e. the 
field is purely magnetic or purely electric. Conversely, if in some 
reference frame E = 0 or H = 0, then they will be perpendicular in 
every other frame, in accordance with our remarks at the end of the 
preceding section. 

An exceptional case occurs when both invariants are zero : E«H = 0 
and E 2 —H 2 = 0. In this case E and H are equal in magnitude and 
mutually perpendicular in every frame of reference. 



CHAPTER 11 



THE ELECTROMAGNETIC FIELD 
EQUATIONS 

§52. The first pair of Maxwell's equations 

From the expressions 

H = curl A, E = — — — grad <b 
c at 

it is easy to obtain equations containing only E and H. To do this we 
find curl E: 

1 8 

curl E = x- curl A — curl grad <b. 

c dt 

But the curl of any gradient is zero. Consequently, 

1 flH 

curlE =-71F- (52 - ,} 

Taking the divergence of both sides of the equation curl A = H, and 
recalling that div curl = 0, we find 

div H = 0. (52.2) 

The equations (52.1) and (52.2) are called the first pair of Maxwell's 
equations* We note that these two equations still do not completely 



t Maxwell's equations (the fundamental equations of electrodynamics) were 
first formulated by him in the 1860's. 
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determine the properties of the field. This is clear from the fact that 
they determine the change of the magnetic field with time (the derivative 
8H/8f), but do not determine the derivative 8E/8/. 

Equations (52. 1) and (52.2) can be written in integral form. Accord- 
ing to Gauss' theorem 

JdivHdF= <f H-df, 

where the integral on the right goes over the entire closed surface 
surrounding the volume over which the integral on the left is extended. 
On the basis of (52.2), we have 

H-df=0. (52.3) 



The integral of a vector over a surface is called the flux of the vector 
through the surface. Thus the flux of the magnetic field through every 
closed surface is zero. 
According to Stokes' theorem, 

JcurlE-df = <f E-dl, 

where the integral on the right is taken over the closed contour bound- 
ing the surface over which the integral on the left is taken. From (52.1) 
we find, integrating both sides over any surface, 

1 8 



E-dl =_ - — — 
c dt 



H-df. (52.4) 



The integral of a vector over a closed contour is called the circulation 
of the vector around the contour. The circulation of the electric field 
is also called the electromotive force in the given contour. Thus the 
electromotive force in any contour is equal to minus the time derivative 
of the magnetic flux through a surface bounded by this contour. 



§53. The action function of the electromagnetic field 

The action function S for the whole system, consisting of an electro- 
magnetic field as well as the particles located in it, must consist of three 
parts : 

S=S f +S m + S mf . (53.1) 
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S m is that part of the action which depends only on the properties 
of the particles, that is, just the action for free particles. For a single 
free particle, it is given by (39.1). If there are several particles, then 
their total action is the sum of the actions for each of the individual 
particles. Thus, 

S m = -£mc jds. (53.2) 

The quantity is that part of the action which depends on the 
interaction between the particles and the field. According to §43, we 
have for a system of particles 




(53.3) 



In each term of this sum, is the potential of the field at that point of 
space-time at which the corresponding particle is located. The sum 
S m +S mf is already familiar to us as the action (43.1) for charges in 
a field. 

Finally S f is that part of the action which depends only on the 
properties of the field itself, that is, Sf is the action for a field in the 
absence of charges. Up to now, because we were interested only in the 
motion of charges in a given electromagnetic field, the quantity S f , 
which does not depend on the particles, did not concern us, since this 
term cannot affect the motion of the particles. Nevertheless this term 
is necessary when we want to find equations determining the field 
itself. This corresponds to the fact that from the parts S m +S m f of the 
action we found only two equations for the field, (52.1) and (52.2), 
which are not sufficient for complete determination of the field. 

To establish the form of the action Sf for the field, we start from the 
following very important property of electromagnetic fields. As experi- 
ment shows, the electromagnetic field satisfies the principle of super- 
position. This principle consists in the statement that the field produced 
by a system of charges is the result of a simple composition of the 
fields produced by each of the charges individually. This means that 
the resultant field intensity at each point is equal to the vector sum of 
the individual field intensities at that point. 
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Every solution of the field equations gives a field that can exist in 
nature. According to the principle of superposition, the sum of any 
such fields must be a field that can exist in nature, that is, must satisfy 
the field equations. 

As is well known, linear differential equations have just this prop- 
erty, that the sum of any solutions is also a solution. Consequently 
the field equations must be linear differential equations. 

From the above discussion, it follows that under the integral sign 
in the expression for the action S f there must stand an expression 
quadratic in the field. Only in this case will the field equations be 
linear; the field equations are obtained by varying the action, and in 
the variation the degree of the integrand decreases by unity. 

The potentials cannot enter into the expression for the action S f , 
since they are not uniquely determined (in S m/ this lack of uniqueness 
was not important). Therefore S^must be the integral of some function 
of the electromagnetic field tensor F MV . But the action must be a scalar 
and must therefore be the integral of some (true) scalar. The only such 
quantity is the product F /ir F 1 "'* 

Thus S f must have the form 

S f = aJJ F^dVdt, dF=dxd>>dz, 

where the integral extends over all space and over the time between 
two given moments; a is some constant. Under the integral stands 
FJF" = 2(H* -E 2 ). The field E contains the derivative 8A/8f; but 
it is easy to see that (8A/8*) 2 must appear in the action with the positive 
sign (and therefore E 2 must have a positive sign). For if (8A/8f) 2 
appeared in S^with a minus sign, then sufficiently rapid change of the 
potential with time (in the time interval under consideration) could 
always make S f a negative quantity with arbitrarily large absolute 



t The function in the integrand of S t must not include derivatives of F M „ 
since the Lagrangian can contain, aside from the coordinates, only their first time 
derivatives; the role of "coordinates" (i.e., parameters to be varied in the principle 
of least action) is in this case played by the field potentials A M ; this is analogous to 
the situation in mechanics where the Lagrangian of a mechanical system contains 
only the coordinates of the particles and their first time derivatives. 
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value. Consequently Sf could not have a minimum, as is required by 
the principle of least action. Thus, a must be negative. 

The numerical value of a depends on the choice of units for measure- 
ment of the field. We note that, after the choice of a definite value for a 
and for the units of measurement of the field, the units for measure- 
ment of all other electromagnetic quantities are determined. From 
now on we shall use the Gaussian system of units; in this system a is a 
dimensionless quantity, equal to — (l/l&r). 

Thus the action for the field has the form 

S f = - — !— | F~F" dQ, dQ = c dt dx dy dz. (53.4) 
Kmc J 

In three-dimensional form: 

[S f = ~jj(E*-H*)dVdt. (53.5) 

In other words, the Lagrangian for the field is 



1 



Lf = ^y E2 ~ H2)dv - (53 - 6) 

The action for field plus charges has the form 

Sf= -Sjmcdj-sj-l^dx*- 1 ^J/^*'dfi. (53.7) 

We note that now the charges are no longer assumed to be small, 
as in the derivation of the equation of motion of a charge in a given 
field. Therefore A M and F^ refer to the actual field, that is, the external 
field plus the field produced by the charges themselves ; A M and F Mr now 
depend on the positions and velocities of the charges. 



§54. The current four-vector 

Instead of treating charges as points, for mathematical convenience 
we frequently consider them to be distributed continuously in space. 
Then we can introduce the charge density q such that q dV is the charge 
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contained in the volume dV. The density q is in general a function of 
the coordinates and the time. The integral of q over a certain volume 
is the charge contained in that volume. 

Here we must remember that charges are actually point-like, so that 
the density q is zero everywhere except at points where charges are 
located, and the integral j q dV must be equal to the sum of the charges 
contained in the given volume. Therefore q can be expressed with the 
help of the ^-function* in the following form : 

Q=^e a d(T-t a ) (54.1) 

a 

where the sum goes over all the charges and r fl is the position vector of 
the charge e a . 



t The i-function d(x) is defined as follows: d(x) = 0, for all non-zero values of 
for * = 0, <5(0) = °°, in such a way that the integral 

f «5(*)dx = 1. (I) 

From this definition there result the following properties : if f(x) is any continuous 
function, then 

J f(x)d(x-a)dx=f(a), (II) 

and in particular, 

j f(x)S(x)dx=f(0). (Ill) 

(The limits of integration, of course, need not be ±o=; the range of integration 
can be arbitrary, provided that it includes the point at which the (5-function does not 
vanish.) 

We also have 

*-*) = **), Hax) = ~ 6(x), (IV) 

in the sense that both sides give the same result when they appear as a factor in an 
integrand. 

In the same way as 6(x) was defined for one variable x, we can introduce a 
three-dimensional i-function, 5(r), equal to zero everywhere except at the origin 
of the three-dimensional coordinate system, and whose integral over all space is 
unity. As such a function we can use the product <3(jc) d(y) <5(z). 
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The charge of a particle is, from its very definition, an invariant 
quantity, that is, it does not depend on the choice of reference frame. 
On the other hand, the density q is not an invariant — only the product 
q dV is invariant. 

Multiplying the equality de = odV on both sides by dy, we have 

dedx" = pdVdx" = pdVdt --. 
e e dt 

On the left stands a four-vector (since de is a scalar and dx 11 is a four- 
vector). This means that the right-hand side must be a four-vector. 
But dV dt is a scalar, and so gidx^/dt) is a four-vector. This vector 
(we denote it by /*) is called the current four-vector : 

j" = e%. (54.2) 

The three space components of this vector form a vector in ordinary 
space, 

j = ev, (54.3) 

where v is the velocity of the charge at the given point. The vector j is 
called the current density vector. The time component of the current 
four-vector is eg. Thus 

j" = (cq, j). (54.4) 

Let us introduce the current four-vector into the expression (53.7) 
for the action and transform the second term in that expression. 
Introducing in place of the point charges e a continuous distribution 
of charge with density q, we must write this term as 



qA m dx"dV, 



replacing the sum over the charges by an integral over the whole 
volume. We rewrite it in the form 



f ic' , .„. 1 
— Q-t- A„dVdt = 



A„jx dQ. 
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Thus the total action S takes the form 



-z{-«^4J AfjM dQ - { ^ d£I (54 - 5) 



§55. The equation of continuity 

The change with time of the charge contained in a certain volume 
is determined by the derivative 

8 



8/ 



qdV. 



On the other hand, the change in unit time is determined by the 
quantity of charge which in unit time leaves the volume and goes to 
the outside or, conversely, passes to its interior. The quantity of 
charge which passes in unit time though the element df of the surface 
bounding the volume is equal to pv-df, where v is the velocity of the 
charge at the point in space where the element df is located. The vector 
df is directed, as always, along the external normal to the surface, that 
is, along the normal toward the outside of the volume under considera- 
tion. Therefore g\-df = j -df is positive if charge leaves the volume, 
and negative if charge enters the volume. The total amount of charge 
leaving the given volume per unit time is consequently <j> j -df, where 
the integral extends over the whole of the closed surface bounding the 
volume. 

From the equality of these two expressions, we get 

'&\ edV= ~j j ' df ' (551) 

The minus sign appears on the right, since the left-hand side is positive 
if the total charge in the given volume increases. This equation is the 
equation of continuity, expressing the conservation of charge, in 
integral form. 

We also write this equation in differential form. To do this we 
apply Gauss' theorem to the right-hand side of (55.1): 

^ j-df = JdivjdK, 
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and find 



divj + -||-l dF= 0. 



Since this must hold for integration over an arbitrary volume, the 
integrand must be zero : 

divj+^- = 0. (55.2) 

This is the equation of continuity in differential form. 

It is easy to check that the expression (54. 1) for o in 6-function form 
necessarily satisfies the equation of continuity. For simplicity we 
assume that we have only one charge, so that 

q = ed(r -T 0 ). 

The current j is then 

j = ev<5(r-r 0 ), 

where v is the velocity of the charge. We determine the derivative 3g/8f. 
During the motion of the charge its coordinates change, that is, the 
vector r 0 changes. Therefore 

9g 9g 9ro 
~dt~ ~ ~8ro ' ~8T ' 

But 8r 0 /8f is just the velocity v of the charge. Furthermore, since o is 
a function of r— r 0 , 

9g _ Bq 
8r 0 8r 

Consequently 

-JL = -v -grade = -div(gv) 

(the velocity v of the charge of course does not depend on r). Thus 
we arrive at the equation (55.2). 

In four-dimensional form the continuity equation (55.2) is expressed 
by the statement that the four-divergence of the current four-vector 
is zero : 

S = 0. (55.3) 
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§56. The second pair of Maxwell's equations 

In finding the field equations from the principle of least action, we 
must assume the motion of the charges to be given and vary only the 
potentials of the field, which here play the role of "generalised coordi- 
nates" of the system. (In finding the equations of motion of a particle 
we did the opposite, assuming the field to be given and varying the 
path of the particle.) This derivation is conveniently done in four- 
dimensional form. 

According to the above remarks the variation of the first term in 
(54.5) is now zero, and we must not vary the current j M in the second 
term. Thus, 



c 



dii = 0 



(where we have used the fact that F^bF^ = F^dF 1 " in varying the 
second term). Substituting 



dA, dA„ 



" dx" dx' 
in the factor dF^, we have 

In the second term we interchange the dummy indices fi and v, and 
also replace F"' by —F 1 ". After this the second and third terms are 
identical, so that 



An dx' 

We also write 



and, applying the four-dimensional form of Gauss' theorem (38.9) 
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to the integral of the first term, we find 

1 



4n dx' 



SA.dQ. A 
M 4nc 



F^bA^dS,. (56.1) 



In the last term we must take the values at the limits of integration. 
The limits for the spatial coordinates are at infinity, where the field 
is zero. At the limits of the time integration, that is, at the given 
initial and final time values, the variation of the potentials is zero, 
since in accordance with the principle of least action the potentials 
are given at these times. Thus the last term in (56.1) is zero, and we 
find 

1 1 df"\ 

Since the variations SA U are arbitrary, the quantity in parentheses 
must be zero : 

Let us express these four (/j, = 0, 1,2, 3) equations in three-dimen- 
sional form. For /j, = 1, 

3F" dF 12 dF™ 1 37™ 4n ., 

1 1 1 — ;1. 

dx dy dz c dt c 
Substituting the values of the components of F MV , we find 
dH z dH y 1 dE x _ 4n 



dy dz c dt c 



Jx 



This together with the two succeeding equations = 2, 3) can be 
written as one vector equation : 

-»7^7* < 5 "> 

Finally, the fourth equation (/j, = 0) gives 

div E = 4tiq. (56.4) 
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Equations (56.3) and (56.4) are the second pair of Maxwell's equa- 
tions.* Together with the first pair of Maxwell's equations they com- 
pletely determine the electromagnetic field, and are the fundamental 
equations of the theory of such fields, i.e. of electrodynamics. 

Let us write these equations in integral form. Integrating (56.4) 
over a volume and applying Gauss' theorem 



we get 



JdivEdK = <f E-df, 
E-df = 4n j e dV. 



(56.5) 



Thus the flux of the electric field through a closed surface is equal to 
4ji times the total charge contained in the volume bounded by the 
surface. 

Integrating (56.3) over an open surface and applying Stokes' the- 
orem 

JcurlH-df = j^H-dl, 

we find 



Hdl 



1 1 1'> "'< " 

c at c 



j-df. 



(56.6) 



The quantity 



1 BE 

4n dt 



(56.7) 



is called the displacement current. From (56.6) written in the form 

1 6E N 



t- tin :; K i- 



we see that the circulation of the magnetic field around any contour 
is equal to 4n\c times the sum of the true current and displacement 
current passing through a surface bounded by this contour. 



t Maxwell's equations in a form applicable to point charges in the electro- 
magnetic field in vacuum were formulated by Lorentz. 
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From Maxwell's equations we can obtain the already familiar 
continuity equation. Taking the divergence of both sides of (56.3), we 
find 

div curl H — — -~ div E+ — div i. 
c at c 

But div curl H = 0 and div E = 4ng, according to (56.4). Thus we 
arrive once more at equation (55.2). 



§57. Energy density and energy flux 

Let us multiply both sides of (56.3) by E and both sides of (52.1) 
by H and subtract the resultant equations. Then we get 

1 ^ 8E 1 „ 8H An . ^ _ T , ^ ^ 
_£.— -+ — H--^r- = j-E-(H-curl E-E-curlH). 

c at c at c 

Using the well-known formula of vector analysis, 
div(aXb) — b-curla— a-curlb, 
we rewrite this relation in the form 

Jc ^( £2 + //2 ) = j-E-div (EXH) 



or 



The vector 



S = ,-EXH (57.2) 
An 

is called the Poynting vector. 

We integrate (57.1) over a volume and apply Gauss' theorem to the 
second term on the right. Then we obtain 



8 

a7 



\ E2 ^n H2 dV= -^i- Edv ~j> s - df - (57-3) 
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If the integral extends over all space, then the surface integral 
vanishes (the field is zero at infinity). Furthermore, we can express 
the integral J j-E dV as a sum 2ev-E over all the charges, and sub- 
stitute from (44.7) 



Then (57.3) becomes 

d 
dt 



{j~8^ dF+ £<4 = °- < 57 " 4 > 



Thus for the closed system consisting of the electromagnetic field 
and particles present in it, the quantity in braces in this equation is 
conserved. The second term in this expression is the kinetic energy 
(including the rest energy of all the particles ; see the footnote on p. 1 57) ; 
the first term is consequently the energy of the field itself. We can 
therefore call the quantity 

E 2 +H z 

W = (57.5) 

the energy density of the electromagnetic field; it is the energy per 
unit volume of the field. 

If we integrate over a finite volume, the surface integral in (57.3) 
generally does not vanish, so that we can write the equation in the 
form 

_6 
87 



jj^^dF+X^J = -^S-df, (57.6) 



where now the second term in the braces is summed only over the 
particles present in the volume under consideration. On the left 
stands the change in the total energy of field and particles per unit 
time. Therefore the integral <|>S-df must be interpreted as the flux 
of field energy across the surface bounding the given volume, so that 
the Poynting vector S is this flux density — the amount of field energy 
passing through unit area of the surface in unit time. 
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§58. Momentum density and momentum flux 

In addition to energy, the electromagnetic field also possesses 
momentum, distributed in space with a definite density. The express- 
ion for this density in terms of the field strengths can be found by a 
derivation similar to the one in the preceding section. 

Let us calculate the time derivative of the integral 

f-r— ExHdF. 

J "tax- 
Carrying out the differentiation under the integral sign and replacing 
the derivatives 8E/8* and 8H/3/ by using Maxwell's equations, we get 

= -^J |EXcurlE+HXcurlHjdF--^j*jXHdF. 

In the first integral we transform the integrand by using the formula 
from vector analysis 

V(a-b) = aXcurlb+bXcurla+(a-v)b+(b-v)a, 

according to which we have 

EX curl E = |-v£ 2 -(E.v)E. 

We also write 

(E-v)E = (v-E)E-E(v-E), 

where, in the term (vE)E, it is assumed that the operator v acts on 
both of the factors that follow it. Finally, noting that, according to 
Maxwell's equation (56.4), vE = div E = 4ng, we write 

EXcurlE = fv£ 2 -(v-E)E+4jreE. 

The product H X curl H is transformed similarly, but since div H = 0, 

HXcurlH = ivi/ 2 -(v-H)H. 
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Thus, 



6^ 
8* 



{ d v = -{ I v(£2+ " 2) - (v - E)E - (v - H)H } d v 

J{ 



- | {gE + ~- jXH[ dK. (58.1) 



In the first integral the operator v acts on all terms following it in 
the integrand. According to vector analysis (the general formulation 
of Gauss' theorem), this integral is transformed into a surface integral 
by writing the surface element df in place of the operator dV- v. In the 
second integral, in which the charge density and current appear, we 
change to the discrete form in terms of a sum over the point charges 
located within the given volume. As a result, (58.1) is rewritten in 
the form 

I j dV = - ji {[(^+^)df-E(E.df)-H(H.df)} - 

-X^E+UxHj. (58.2) 

If the integral is extended over all space, the integral over the 
(infinitely distant) surface vanishes. The expression under the sum- 
mation sign in (58.2) is the force acting on the charge. According to 
the equation of motion (44.5) it can be replaced by the derivative dp/df 
of the momentum of the particle. Then (58.2) can be written in the 
form 

It is obviously a statement of the law of conservation of the total 
momentum of the system of particles plus field. The first term in the 
braces is thus the momentum of the electromagnetic field, while the 
integrand in it can be regarded as the momentum density; we denote 
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it by P (<m) : 

po™> = EXH/4jtc = S/c 2 . (58.4) 

We point out that (except for a factor 1 /c 2 ) the momentum density 
coincides with the flux density of the field energy. 

But if the integration on the left-hand side of (58.2) is extended 
over some finite volume of field, the surface integral is not equal to 
zero. We write it in a more compact form, introducing the three- 
dimensional tensor 

o ik = ^ jy (E 2 + H*)b ik - E,E k -//,//* J . (58.5) 
In expanded form its components are 

a xy = — — (E x E y + H x Hy) 

etc. 

The integrand in the surface integral in (58.2) is a vector; using the 
tensor (58.5), we can write its z'th component as a ik df k . Thus the vector 
equation of momentum conservation (58.2), when written in this 
compact form, is 

JLJ pj^dV+^p^ = -j>a ik df k . (58.6) 

We then see that the integral on the right-hand side represents the 
flux of field momentum out of the volume. The product a ik df k is the 
flux of momentum through the surface element df. By definition, the 
vector df is directed along the external normal to the surface. If we 
denote the unit normal vector by N, then df = N df, and 

Oik d/fc = a, k N k df; 
we see that the vector with components a ik N k is the momentum flux 
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density in the direction N, i.e. the flux through unit area perpendicular 
to N. Substituting for a ik from (58.5), we find that this vector is 

i {t (£2+ ^ 2)N-e(n * E) ~ h(n-h) }' (587) 

The tensor a ik is called the Maxwell stress tensor. According to our 
discussion, the component a ik is the flux density of the ith component 
of the momentum in the direction of the x k axis. We note that the 
stress tensor, as we see from (58.5), is symmetric (a ik = a kl ). 
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CONSTANT ELECTROMAGNETIC 
FIELDS 

§59. Coulomb's law 

For a constant electric (electrostatic) field, Maxwell's equations 
have the form 

divE = 4^ e , (59.1) 
curlE = 0. (59.2) 

The electric field E is expressed in terms of the scalar potential alone 
by the relation 

E = -grad<£. (59.3) 

Substituting (59.3) in (59.1), we get the equation which is satisfied 
by the potential of a constant electric field : 

A<£ = -4tiq. (59.4) 

This equation is called Poissorfs equation. In vacuum, i.e., for q = 0, 
the potential satisfies Laplace's equation 

A<t> = 0. (59.5) 

From the last equation it follows, in particular, that the potential 
of the electric field can nowhere have a maximum or a minimum. For 
in order that <j> have an extreme value, it would be necessary that the 
first derivatives of (j> with respect to the coordinates be zero, and that 
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the second derivatives d 2 <j>/dx 2 , d 2 <j>/dy 2 , d 2 <j>/dz 2 all have the same sign. 
The last is impossible, since in that case (59.5) could not be satisfied. 

We now determine the field produced by a point charge. From 
symmetry considerations, it is clear that it is directed along the radius- 
vector from the point at which the charge e is located. From the same 
considerations it is clear that the value E of the field depends only on 
the distance R from the charge. To find this absolute value, we apply 
equation (59.1) in the integral form (56.5). The flux of the electric 
field through a spherical surface of radius R circumscribed around 
the charge e is equal to 4nR 2 E; this flux must equal Ane. From this 
we get 

E = e/R 2 . 

In vector notation, 

E = eR/R 3 . (59.6) 

Thus the field produced by a point charge is inversely proportional to 
the square of the distance from the charge. This is Coulomb's law. The 
potential of this field is, clearly, 

(p = e/R. (59.7) 

If we have a system of charges, then the field produced by this 
system is equal, according to the principle of superposition, to the 
sum of the fields produced by each of the charges individually. In 
particular, the potential of such a field is 

* = ?X* (59 " 8) 

where R a is the distance from the charge e a to the point at which we 
are determining the potential. If we introduce the charge density q, 
this formula takes on the form 




(59.9) 



where R is the distance from the volume element dV to the given field 
point. 
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We note a mathematical relation which is obtained from (59.4) by 
substituting the values of q and <j> for a point charge, i.e. q = ed(R) 
and <f> = e/R. We then find 

A(l//i) = -4jt8(R). (59.10) 



§60. Electrostatic energy of charges 

Let us determine the potential energy of a system of charges. We 
start from the energy of the field, that is, from the expression (57.5) 
for the energy density. The energy of the system of charges must 
be equal to 



E 2 dV, 



where E is the field produced by these charges, and the integral goes 
over all space. Substituting E = — grad <j>, U can be changed to the 
following form : 

U = --~|E-grad</>dK= - ^ J div (E<£) d V+ ^ j <f> div E d V. 

According to Gauss' theorem, the first integral is equal to the integral 
of E</> over the surface bounding the volume of integration, but since 
the integral is taken over all space and since the field is zero at infinity, 
this integral vanishes. Substituting in the second integral div E = 4jtQ, 
we find the following expression for the energy of a system of charges : 

U = ±fQ<f>dV. (60.1) 

For a system of point charges, e a , we can write in place of the integral 
a sum over the charges 

U = fEe^., (60.2) 



where 4> a is the potential of the field produced by all the charges, at 
the point where the charge e a is located. 
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According to formula (59.8), the potentials <j) a are equal to 

where R ab is the distance between the charges e a , e b . For a system of 
point charges, this expression contains an infinite term coming from 
the potential of the self-field of the charge e a (the term of the sum 
with b = a, in which = 0). Accordingly, there is an infinite con- 
stant in the energy (60.2), which is independent of the separation of 
the charges. This part of the energy — the self-energy of the charges — is 
physically meaningless (see below) and should be dropped. There then 
remains only the interaction energy of the charges that depends on 
their separations. It is equal to 

tf' = T2> a &, (60.3) 

a 

where 

<t>' a = £ (60.4) 

b*a R °b 

is the potential at the point of location of e a , produced by all the charges 
other than e a . In other words, we can write 

tf' = jZ^- (60.5) 

2 a*b Rab 

In particular, the energy of interaction of two charges is 

= (606) 

Let us look at the infinite self-energy of an elementary charged 
particle that was mentioned above. It arose as a result of considering 
the particles to be points. But such a treatment is unavoidable in 
classical (non-quantum) relativistic theory because of the fundamental 
principles of the theory of relativity. For, when in classical theory we 
speak of an elementary particle, we mean a particle whose mechanical 
state is completely described by giving its coordinates and its velocity 
of motion as a whole. If such a particle were extended, then it would, 
in any case, have to be regarded as an absolutely rigid body (i.e. a 
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body incapable of deformation), since the very concept of deformation 
is related to the possibility of independent displacements of individual 
parts of the body. But in relativistic mechanics the existence of absolute- 
ly rigid bodies is impossible, as can be seen from the following argu- 
ments. 

Suppose that a rigid body is set in motion by an external force 
acting at one of its points. If the body were absolutely rigid, then all 
of its points would have to start moving simultaneously with the point 
that was subjected to the force ; otherwise the body would deform. But, 
because there is a limiting velocity of propagation of interactions, the 
force is transmitted from the initial point of application to other 
points with a finite velocity, and therefore all the points of the body 
cannot begin to move simultaneously. 

Thus, according to electrodynamics the electron should have an 
infinite self-energy, and consequently an infinite mass. The physical 
meaninglessness of this result shows that electrodynamics as a logically 
closed physical theory becomes self-contradictory when we go to 
sufficiently small distances. We may ask about the order of magnitude 
of these distances. We can answer this question by noting that we 
should obtain a value for the electromagnetic self-energy of the electron 
that is of the order of the rest energy mc 2 . If now we consider the elec- 
tron as having a certain size r e , its self-potential energy would be of 
order e 2 /r e . From the requirement that these quantities be of the 
same order of magnitude, e 2 /r e ~ mc 2 , we find 

r e ~ eVmc 2 . (60.7) 

This dimension (called the "radius" of the electron) determines the 
limit of applicability of electrodynamics to the electron which already 
follows from the fundamental principles of electrodynamics. We must, 
however, keep in mind that actually the limit of applicability of the 
classical electrodynamics which is presented here lies much higher, 
because of quantum phenomena.* 



t Quantum effects begin to be important at distances of order hjmc, where h 
is Planck's constant. 
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§61. The field of a uniformly moving charge 

We determine the field produced by a charge e, moving uniformly 
with velocity V. We call the laboratory system the frame K; the frame 
of reference moving with the charge is the frame K'. Let the charge 
be located at the origin of coordinates of the frame K'. The frame K' 
moves relative to K along the axis X; the axes Y and Z are parallel 
to Y' and Z'. At the time t = 0 the origins of the two systems coincide. 
The coordinates of the charge in the frame K are consequently x = Vt, 
y = z = 0. 

In the K' system the electric field is constant : 

eR' 

and the magnetic field is absent. Transforming to the frame K, using 
(50.5), we have 

£ - — E - 6y ' E eZ ' 



R' 3 ' ' /{'VU -(^/c 2 )] ' R' 3 Vll-(.V 2 /c 2 )] ' 

(61.2) 

We must now express R', x', y', z' in terms of the coordinates x, y, z in 
the frame K. According to the formulae for the Lorentz transfor- 
mation, 

x-Vt 



so that 



where 



R2 = T^h)> (6L3) 



R** = ( x -vtf+[l _(K2/c 2 )](j 2 +z 2 ). (61.4) 
Substituting these expressions in (61.2), we obtain 



where R is the radius vector from the charge e to the field point with 
coordinates x, y, z (its components are x—Vt, y, z). 
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This expression for E can be written in another form by introducing 
the angle 6 between the direction of motion and the radius vector R. 
It is clear that y" + z 2 = R 2 sin 2 6, and therefore 

R* 2 = R 2 [l-(V 2 /c 2 )sin 2 d]. 

Then we have for E 

eR l-V 2 /c 2 

R 3 [\-(y 2 lc 2 )sm 2 e]W y } 

For a fixed distance R from the charge, the value of the field E 
increases as 6 increases from 0 to »/2 (or as 6 decreases from jt to jt/2). 
The field along the direction of motion (6 = 0, jt) has the smallest 
value ; it is equal to 

£,l = (e/R 2 )[l-(V 2 /c 2 )}. 

The largest field is that perpendicular to the velocity (0 = »/2), equal 
to 



R 2 VV-(v 2 /c 2 )]' 

We note that, as the velocity increases, the field decreases, while 
E ± increases. We can describe this pictorially by saying that the electric 
field of a moving charge is "contracted" in the direction of motion. 
For velocities V close to the velocity of light, the denominator in 
formula (61.6) is close to zero in a narrow interval of values 6 around 
the value 6 = n/2. The "width" of this interval is, in order of magnitude, 

A0 ~ VH ~(V 2 /c 2 )]. 

Thus the electric field of a rapidly moving charge at a given distance 
from it is significantly different from zero only in a narrow range of 
angles in the neighbourhood of the equatorial plane, and the width 
of this interval decreases with increasing V as y^l — (V 2 /c 2 )]. 
The magnetic field in the frame K is 

H = -VXE (61.7) 
c 
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[see (50.7)]. In particular, for V <sc c the electric field is given approxi- 
mately by the usual formula for Coulomb's law, E = eR/i? 3 , and the 
magnetic field is 

H = |^- (61-8) 



PROBLEM 

Determine the force (in the frame K) between two charges moving with the 
same velocity V. 

Solution. We shall determine the force F by computing the force acting on one 
of the charges (ej) in the field produced by the other (e 2 ). Using (61.7), we have 

F = ^Ej+^-VXH, = e^l-i^E. + ^i- VCV-Ej). 

Substituting for E 2 from (61.6), we get for the components of the force in the 
direction of motion (F x ) and perpendicular to it (F r ) 

ei e 2 [l-(K'/c 2 )]cos9 ei e t [l-(K 2 /c 2 )] 2 sin 0 



R* [l-(F I /c ^ )sin 2 e] , " ^ • ' [l-(K2/c 11 )sin 2 e] s ' , * 
where R is the radius vector from e 2 to e x , and 6 is the angle between R and V. 



§62. The dipole moment 

We consider the field produced by a system of charges at large 
distances, that is, at distances large compared with the dimensions of 
the system. 

We introduce a coordinate system with origin anywhere within the 
system of charges. Let the position vectors of the various charges be 
r a . The potential of the field produced by all the charges at the point 
having the position vector Ro is 

(the summation goes over all charges); here R 0 — r a are the radius 
vectors from the charges e a to the point where we are finding the 
potential. 
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We must investigate this expression for large Ro (Ro ^ r fl ). To do 
this, we expand it in powers of r a /i? 0 , using the formula 

/(Ro - r) « /(Ro) -r -grad /(R 0 ) 

(in the grad, the differentiation is with respect to the coordinates of 
the vector Ro). To terms of first order, 



4> = -„ Ze fl r a -grad— - 



(62.2) 



The sum 

d = £«A (62.3) 

is called the dipole moment of the system of charges. 

It is important to note that, if the sum of all the charges, Ze a , is 
zero, then the dipole moment does not depend on the choice of the 
origin of coordinates, for the position vectors r a and r' a of one and 
the same charge in two different coordinate systems are related by 

r' a = r fl +a, 

where a is some constant vector. Therefore if Ee a = 0, the dipole 
moment is the same in both systems : 

d' = £ ej'a = £ e fl r fl + a £ e a = d. 

In particular, for a system of two charges with opposite signs (±e) 
the dipole moment d = ex, where r is the radius vector from the charge 
— e to the charge + e. 

If the total charge of the system is zero, then the potential of the 
field of this system at large distances is 

± J 1 d'Ro . . 



The field intensity is 



E = -grad-^ = --^ grad (d-Ro)-(d.Ro) grad 
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or finally, 



E= 3(n^-d j (625) 



where n is a unit vector along R 0 . 

Thus the potential of the field at large distances produced by a 
system of charges with total charge equal to zero is inversely pro- 
portional to the square of the distance, and the field intensity is 
inversely proportional to the cube of the distance. This field has axial 
symmetry around the direction of d. In a plane passing through this 
direction (which we choose as the z axis), the components of the 
vector E are 

, 3 cos 2 0 — 1 „ , 3 sin 6 cos 6 „ 

E ' = d — M — ' — M — ' } 

The radial and tangential components in this plane are 

(62.7) 



§63. The quadrupole moment 

In the expansion of the potential in powers of 1 /R 0 , 

$ = ^(o) + ^(i) +< ^(2) + (63.1) 

the term 0 (n) is proportional to 1/R£ +1 . We saw that the first term, 
0 (O) , is determined by the sum of all the charges; the second term, 
called the dipole potential of the system, is determined by the 
dipole moment of the system. 
The third term in the expansion is 

^ = ^^6^(i)> < 63 - 2) 

where the sum goes over all charges ; we here omit the suffix numbering 
the charges; x, are the components of the vector r, and X t those of 
the vector Ro. This part of the potential is usually called the quadrupole 
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potential. If the sum of the charges and the dipole moment of the 
system are both equal to zero, the expansion begins with $ (2) . 

In the expression (63.2) there enter the six quantities Uex t x k . 
However, it is easy to see that the field depends not on six independent 
quantities, but only on five. This follows from the fact that the function 
l/Ro satisfies Laplace's equation, that is, 

We can therefore write $ (2) in the form 
The tensor 

Dik = Xe(3x,x fc -r 2 i5ffc) (63.3) 

is called the quadrupole moment tensor of the system. From the de- 
finition of D ik it is clear that the sum of its diagonal elements is zero : 

D tl = 0. (63.4) 

Therefore the symmetric tensor D ik has only five independent com- 
ponents. With the aid of D ik we can write 

*"-£wbj-(i)- 

or, performing the differentiation, 

3 2 / 1 \ 3X,X k d ik 
dX,dX k [Ro) iJg R* ' 

and using the fact that d lk D ik = D u = 0, 

Like every symmetric three-dimensional tensor, the tensor D lk can 
be brought to principal axes. Because of (63.4), in general only two 
of the three principal values will be independent. If it happens that 
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the system of charges is symmetric around some axis (the z axis)* 
then this axis must be one of the principal axes of the tensor D ik ; the 
location of the other two axes in the x, y plane is arbitrary, and the 
three principal values are related to one another : 

D xx = D yy = -\D ZZ . (63.7) 

Denoting the component D zz by D (in this case it is simply called 
the quadrupole moment), we get for the potential 

4> w = 4^r(3cos2 0-l), (63.8) 

where 8 is the angle between Ro and the z axis. 

Just as we did for the dipole moment in the preceding section, we 
can easily show that the quadrupole moment of a system does not 
depend on the choice of the coordinate origin, if both the total charge 
and the dipole moment of the system are equal to zero. 

In similar fashion we could also write the succeeding terms of the 
expansion (63.1). The /th term of the expansion defines a tensor 
(which is called the 2'-pole moment tensor) of rank /, symmetric in 
all its indices and vanishing when contracted on any pair of indices ; 
it can be shown that such a tensor has 21+ 1 independent components. 

PROBLEM 

Determine the quadrupole moment of a uniformly charged ellipsoid with respect 
to its centre. 

Solution. Replacing the summation in (63.3) by an integration over the volume 
of the ellipsoid, we have 

A« = g J J* J (2x*-y*-z*) dx dy dz, etc. 

The integration over the volume of the ellipsoid can be reduced to integration 
over the volume of the unit sphere as was done in §25, Problem 2(e). As a result 
we obtain 

i)„ = J«(2 a »-6«-c I ), D„ = ie(26*-fl*- C *), 
D„ = ie(2c ! - fl 2 -6*), 
where e — (47i/3)abcQ is the total charge of the ellipsoid. 



t We are assuming a symmetry axis of any order higher than the second. 
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§64. System of charges in an external field 

We now consider a system of charges located in an external electric 
field. We designate the potential of this external field by <f>(r). The 
potential energy of each of the charges is e a (f>(T a ), and the total potential 
energy of the system is 

U = Y j e a <t>(r a ). (64.1) 

a 

We again introduce a coordinate system with its origin anywhere 
within the system of charges; r fl is the position of the charge e a in 
these coordinates. 

Let us assume that the external field changes slowly over the region 
of the system of charges, i.e. is quasi-uniform with respect to the system. 
Then we can expand the energy U in powers of r a . In this expansion, 

U = t/<°> + t/W + c/(2) + . . . , (64.2) 

the first term is 

£/ (0) = 4>oI> fl , (64.3) 

where <f> 0 is the value of the potential at the origin. In this approxima- 
tion, the energy of the system is the same as it would be if all the charges 
were located at one point. 
The second term in the expansion is 

t/<» = (grad</>VX<? fl r fl . 

Introducing the field intensity E 0 at the origin and the dipole moment 
d of the system, we have 

C/«) = -d-E 0 . (64.4) 

The total force acting on the system in the external quasi-uniform 
field is, to the order we are considering, 

F = EoX<? fl + (vd-E) 0 . 

If the total charge is zero, the first term vanishes, and 

F = (d-V)E, (64.5) 
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i.e. the force is determined by the derivatives of the field intensity 
(taken at the origin). The total moment of the forces acting on the 
system is 

K = £(r a Xe a Eo) = dXEo, (64.6) 

i.e. it is determined by the field intensity itself. 

Let us assume that there are two systems, each having total charge 
zero, and with dipole moments di and d2, respectively. Their mutual 
distance R is assumed to be large in comparison with their internal 
dimensions. Let us determine their potential energy of interaction, U. 
To do this we regard one of the systems as being in the field of the 
other. Then 

V= -dj-Ei, 

where Ei is the field of the first system. Substituting (62.5) for Ei, we 
find 

(d 1 -d 2 )-3(d 1 -n)(d 2 -n) 

U = -jp , (64.7) 

where n is a unit vector from one system to the other. 

For the case where one of the systems has a total charge different 
from zero (and equal to e), we obtain similarly 

U=e^, (64.8) 

where n is the unit vector directed from the dipole to the charge. 
The next term in the expansion (64.2) is 

Here, as in §63, we omit the suffix numbering the charge ; the values 
of the second derivatives of the potential are taken at the origin ; but 
the potential <t> satisfies Laplace's equation, 

3x 2 ,k dXidx k 
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Therefore we can write 



or, finally, 



U"> = ^ J!*L . ,64.9) 
6 ox, ox k 



§65. Constant magnetic field 

Let us consider the magnetic field produced by charges which 
execute a finite motion, in which the particles are always within a 
finite region of space and the momenta also always remain finite. 
Such a motion has a "stationary" character, and it is of interest to 
consider the time average magnetic field H, produced by the charges ; 
this field will now be a function only of the coordinates and not of 
the time, that is, it will be constant. 

In order to find equations for the average magnetic field H, we take 
the time average of Maxwell's equations 

. . n | 1 3E An . 

divH = 0, curlH = ^ — | j. 

c dt c 

The first of these gives simply 

divH = 0. (65.1) 

In the second equation the average value of the derivative dE/dt, like 
the derivative of any quantity which varies over a finite range, is zero t . 



t Suppose that / is such a quantity. Then the average value of the derivative 
df/dt over a time interval 7* is 

$ = 1 f d L a, = /( r >-/(°) 

dt T J dt T 

o 

Since /(/) varies only over a finite range, as T increases without limit the average 
value tends to zero. 
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Therefore the second Maxwell's equation becomes 

Art 

curlR = — j. (65.2) 
c 

These two equations determine the constant field H. 
We introduce the average vector potential A in accordance with 

curl A = H. 

We substitute this in equation (65.2), and find 

— _ 4rr _ 
grad div A — aA = — j. 

c 

But we know that the vector potential of a field is not uniquely defined, 
and we can impose an arbitrary auxiliary condition on it. On this 
basis, we choose the potential A so that 

div A = 0. (65.3) 

Then the equation defining the vector potential of the constant mag- 
netic field becomes 

Arr 

AA j. (65.4) 

c 

It is easy to find the solution of this equation by noting that (65.4) 
is completely analogous to Poisson's equation (59.4) for the scalar 
potential of a constant electric field, where in place of the charge 
density q we here have the current density j/c. By analogy with the 
solution (59.9) of Poisson's equation, we can write 



- L f 1 

cj R 



dV, ( 65 - 5 ) 



where R is the distance from the field point to the volume element dV. 

In formula (65.5) we can go over from the integral to a sum over 
the charges, by substituting in place of j the product q\, and recalling 
that all the charges are point-like. In this we must keep in mind that 
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in the integral (65.5), R is simply an integration variable, and is 
therefore not subject to the averaging process. If we write in place of 
the integral 

JldF thesum Y.^, 

then R a here are the positions of the various particles, which change 
during the motion of the charges. Therefore we must write 

A = -I^L, (65.6) 

where we average the whole expression under the bar. 
Knowing A, we can also find the magnetic field, 

H = curl A = curl— J-dK 
c J R 

The curl operator refers to the coordinates of the field point. Therefore 
the curl can be brought under the integral sign and j can be treated as 
constant in the differentiation. Applying the well-known formula 

curl /a = / curla+grad/Xa, 

where / and a are an arbitrary scalar and vector, to the product j • 1 /R, 
we get 

, j * 1 t jxR 

curl- =grad— Xj =-^-, 

and consequently, 

H = lf-l^-dF (65.7) 
c J R 3 

(the radius vector R is directed from dV to the field point). This is 
Biot and Savarfs law. 
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§66. Magnetic moments 

Let us consider the average magnetic field produced by a system of 
charges in steady motion, at large distances from the system. 

We introduce a coordinate system with its origin anywhere within 
the system of charges, just as we did in §62. Again we denote the 
position vectors of the various charges by r a , and the position vector 
of the point at which we calculate the field by R 0 . Then R 0 — r a is the 
radius vector from the charge e a to the field point. According to (65.6), 
we have for the vector potential 



A=Iv_ 5 f? 1 ^-. (66.1) 
c ^ |R 0 -r a | 

As in §62, we expand this expression in powers of r 0 . To terms of 
first order we have (omitting the suffix a) 



1 I'v-j2>( r - v i„) 



cR 0 



In the first term we can write 



But the average value of the derivative of a quantity varying within 
a finite interval (like Let) is zero. Thus there remains for A the 
expression 

We transform this expression as follows. Noting that v = r, we can 
write (remembering that Ro is a constant vector) 

£e(RoT)v = \ A^er(r-Ro)+IZe[v(r.Ro)-r(v.Ro)]. 
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Upon substitution of this expression in A, the average of the first 
term (containing the time derivative) again goes to zero, and we get 



A = 2c^£ e[v(r ' Ro) ~ r(V ' Ro)] ' 



We introduce the vector 

1 



m=^£fTXv, (66.2) 
which is called the magnetic moment of the system. Then we get for A 



mXRo _ _J__- 



A = - :: V = v-rXi. (66.3) 



Knowing the vector potential, it is easy to find the magnetic field. 
With the aid of the formula 

curl (aXb) = (b-v)a — (a-v)b+a div b — b div a, 

we find 



_ ,/mXRo\ _ ,. Ro ,Ro 

H = curl ( ) = md,v ^r _(m ' v) ^ 

Furthermore, 



div ^j[ = Ro-grad-^p + - / ^divR„ = 0 



and 



._ , Ro 1 1 m 3Ro(E5.R 0 ) 

(m - v) *r = -Ri (•"•v)Ro+Ro(m.v)- ¥ = ^ — ^— 



Thus, 

B 3n(ii 

*o 3 



where n is again the unit vector along R 0 . We see that the magnetic 
field is expressed in terms of the magnetic moment by the same 
formula by which the electric field was expressed in terms of the dipole 
moment [see (62.5)]. 
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If all the charges of the system have the same ratio of charge to 
mass, then we can write 

If the velocities of all the charges v <sc c, then mv is the momentum 
p of the charge and we get 

m =2^ r X P =2^ M ' (66.5) 

where M = Er X p is the mechanical angular momentum of the system . 
Thus in this case, the ratio of magnetic moment to angular momentum 
is constant and equal to ejlmc. 



PROBLEM 

Find the ratio of the magnetic moment to the angular momentum for a system 
of two charges (velocities v <k c). 

Solution. Choosing the origin of coordinates at the centre of mass of the two 
particles we have OT 1 r 1 +m 2 r 2 = 0 and p, = — p 2 = p, where p is the momentum 
of the relative motion. With the aid of these relations, we find 

n * (j^_ + _fO_M_M. 
2c \ m\ m\ I m l +m t 



§67. Larmor precession 

Let us consider a system of charges in an external constant uniform 
magnetic field. 
The time average of the force acting on the system, 

F= X-iO<H = iL £-rXH, 
^ c at ^ c 

is zero, as is the time average of the time derivative of any quantity 
which varies over a finite range. The average value of the moment of 
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the forces is 

K = E|(rX(vXH)) 

and is different from zero. It can be expressed in terms of the magnetic 
moment of the system, by expanding the vector triple product : 

K = E|{v(r.H)-H(vr)} = £| Jv(r-H)-1 H . 

The second term gives zero after averaging, so that 

K = E|^H) = ~Ee{^H)-r(^H)} 

[the last transformation is analogous to the one used in deriving (66.3)], 
or finally 

K = mXH. (67.1) 

We call attention to the analogy with formula (64.6) for the electrical 
case. 

We now consider a system of identical charges executing a finite 
motion (with velocities v <sc c) in the centrally symmetric electric field 
produced by a certain fixed charge (say, the system of electrons of 
an atom in the field of the nucleus). We assume that this system is in 
a weak uniform magnetic field. 

In the absence of the external field the total angular momentum M 
of the system would be a constant. The presence of the weak magnetic 
field results in a slow change of M with time. Let us consider the 
character of this change. To eliminate the effect of the rapidly varying 
intrinsic motion of the charges in the system, we average M over the 
period of this motion. 

According to the familiar equation of mechanics [cf. (27.3)], 

dM/dr = K, 

where K is the torque of the external forces acting on the system 
(averaged over the same time interval as was M). According to (67. 1) 
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and (66.5) we have 



K = mXH = 



e 



MXH. 



2mc 



Therefore 



dM/d? = -SiXM, 



(67.2) 



where 



SI = eHjlmc. 



(67.3) 



An equation of the form of (67.2) means that the vector M (and 
with it the magnetic moment m) rotates with angular velocity — SI 
around the direction of the field, keeping fixed its absolute magnitude 
and the angle it makes with the field direction. This phenomenon is 
called the Larmor precession, and the angular velocity (67.3) the Larmor 
frequency. 

We can now make more precise what we meant above by a suffici- 
ently weak field: the Larmor frequency O must be small compared 
with the frequencies of the intrinsic finite motion of the charges of 
the system. It is obvious that only then is it meaningful to consider 
the change with time of the angular momentum averaged in the 
fashion described above. 



CHAPTER 13 



ELECTROMAGNETIC WAVES 



§68. The wave equation 

The electromagnetic field in vacuum is determined by Maxwell's 
equations, in which we must put q = 0, j = 0. We write them once 
more: 

curlE= divH = 0, (68.1) 

c tit 

curl H = — 4*^ , div E = 0. (68.2) 
c ct 

These equations may possess non-zero solutions. This means that an 
electromagnetic field can exist even in the absence of any charges. 

Electromagnetic fields occurring in vacuum in the absence of 
charges are called electromagnetic waves. We now take up the study 
of the properties of such fields. 

First of all we note that such fields must necessarily be time-varying. 
For, in the contrary case, dHjdt = dEjdt = 0 and the equations 

(68.1) and (68.2) go over into the equations (59.1), (59.2) and (65.1), 

(65.2) of a constant field, in which, however, we now have q = 0, j = 0. 
But the solutions of these equations which are given by formulae 
(59:9) and (65.5) become zero for q = 0, j = 0. 

Let us derive the equations determining the potentials of electro- 
magnetic waves. 

As we already know, because of the ambiguity in the potentials we 
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can always subject them to an auxiliary condition. For this reason, 
we choose the potentials of the electromagnetic waves so that the 
scalar potential is zero : 

<f> = 0. (68.3) 

Then 

1 r)A 

E= , H = curlA. (68.4) 

c of 

Substituting these two expressions in the first equation (68.2), we get 

1 8 2 A 

curlcurlA = — AA+graddivA = w . (68.5) 

cr at* 

Despite the fact that we have already imposed one auxiliary condi- 
tion on the potentials, the potential A is still not completely unique : 
we can add to it the gradient of an arbitrary function which does not 
depend on the time (meanwhile leaving (f> unchanged). In particular, 
we can choose the potential of the electromagnetic wave so that 

divA = 0. (68.6) 
For, substituting E from (68.4) in div E = 0, we have 

dlv -eT = e7 dlvA = 0 ' 

that is, div A is a function only of the coordinates. This function can 
always be made zero by adding to A the gradient of a suitable time- 
independent function. 
The equation (68.5) now becomes 

1 d 2 A 

**~*-w = °- (68 - 7) 

This is the equation which determines the potential of electromagnetic 
waves. It is called d'Alemberfs equation, or the wave equation. 

Applying to (68.7) the operators curl and d/dt, we can verify that 
the electric and magnetic fields E and H satisfy the same wave equa- 
tion. 
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§69. Plane waves 

Let us consider the special case of electromagnetic waves in which 
the field depends only on one coordinate, say x (and on the time). 
Such waves are said to be plane. In this case the equation for the field 
becomes 

where by / is understood any component of the vectors E and H. 
To solve this equation, we rewrite it in the form 

8 3 



\dt dx)\dt dx) J 



and introduce new variables 



x x 

f = ,-_ rj=t + - 

c c 



so that t = |(ij + f), x = \c(r}-£). Then 
6T 



_ Id _3\ _3_ _ J_ /_8_ 8_\ 

~ 2 \ dt C dx)' Br) ~ 2 \ dt +C dx}' 



so that the equation for /becomes 



= 0. 



The solution obviously has the form / = fx{£)+h(r}), where fx and f 2 
are arbitrary functions. Thus 

/ = *('-7) + *(' + t)- (69>2) 
Suppose, for example, f 2 = 0, so that 
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Let us clarify the meaning of this solution. In each plane x = con- 
stant, the field changes with the time; at each given moment, the field 
is different for different x. It is clear that the field has the same values 
for coordinates x and times t which satisfy the relation t—(x/c) = 
= constant, that is, 

x = constant + ct. 

This means that if, at some time t = 0, the field at a certain point x 
in space had some definite value, then after an interval of time t the 
field has that same value at a distance ct along the x axis from the 
original place. We can say that all the values of the electromagnetic 
field are propagated in space along the x axis with a velocity equal 
to the velocity of light, c. 

Thus, fi(t — x/c) represents a plane wave moving in the positive 
direction along the x axis. It is easy to show that f 2 (t + x/c) represents 
a wave moving in the opposite, negative, direction along the x axis. 

In §68 we showed that the potentials of the electromagnetic wave 
can be chosen so that <f> = 0, and div A = 0. We choose in this same 
way the potentials of the plane wave which we are now considering. 
The condition div A = 0 gives in this case 

dAJdx = 0, 

since all quantities are independent of y and z. According to (69.1) 
we then have also d 2 A x /dt 2 = 0, that is, dAJdt = constant. But the 
derivative dA/dt determines the electric field, and we see that the 
non-zero component A x represents in this case the presence of a 
constant longitudinal electric field. Since such a field has no relation 
to the electromagnetic wave, we can put A x = 0. 

Thus the vector potential of the plane wave can always be chosen 
perpendicular to the x axis, i.e. to the direction of propagation of that 
wave. 

Let us consider a plane wave moving in the positive direction of the 
x axis; in this wave, all quantities, and in particular A, are functions 
only of t—(x/c). From the formulae 

E=-i^, H = curlA, 
c at 
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we therefore obtain 

E = --A', H = VXA = ^(t--)xA' = --nXA', (69.3) 

c \ c J c 

where the prime denotes differentiation with respect to t—(xjc) and 
n is a unit vector along the direction of propagation of the wave. 
Substituting the first equation in the second, we obtain 

H = nXE. (69.4) 

We see that the electric and magnetic fields E and H of a plane 
wave are directed perpendicular to the direction of propagation of the 
wave. For this reason, electromagnetic waves are said to be transverse. 
From (69.4) it is clear also that the electric and magnetic fields of 
the plane wave are perpendicular to each other and equal to each other 
in absolute value. 

The energy flux in the plane wave is 

S = - C EXH = — E^n = 4- # 2 n- 
An An An 

Thus the energy flux is directed along the direction of propagation of 
the wave. Since 

W= -L(E* + H*) = ~ 
&n An 

is the energy density of the wave, we can write 

S = cWn, (69.5) 

in accordance with the fact that the field propagates with the velocity 
of light. 

The momentum per unit volume of the electromagnetic field is S/c 2 . 
For a plane wave this gives (Wjc)n. We call attention to the fact that 
the relation between the energy W and the momentum Wjc for the 
electromagnetic wave is the same as for a particle moving with the 
velocity of light [see (39.12)]. 

The flux of momentum of the field is determined by the components 
a ik of the Maxwell stress tensor (58.5). Again choosing the direction 
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of propagation of the wave as the x axis, we find that the only non- 
zero component is 

oxx = W. (69.6) 

As it must be, the flux of momentum is along the direction of prop- 
agation of the wave, and is equal in magnitude to the energy density. 

PROBLEM 

Determine the force exerted on a wall from which an incident plane electro- 
magnetic wave is reflected (with reflection coefficient R). 

Solution. The force f acting on unit area of the wall is given by the flux of 
momentum through this area, i.e., it is the vector with components 

/< = o ik N k +a' ik N k 

where N is the vector normal to the surface of the wall, and a ik and a' u are the 
components of the stress tensors for the incident and reflected waves. Using (69.6), 
we obtain 

f= Wn(N-n)+ 0"n'(N-n'). 

From the definition of the reflection coefficient, we have W = RW. Also introduc- 
ing the angle of incidence 6 (which is equal to the reflection angle) and writing out 
components, we find the normal force (light pressure) 

f s = fV(i+R)cos'8 

and the tangential force 

/, = W(l-R) sin 0 cos 0. 

§70. Monochromatic plane waves 

A very important special case of electromagnetic waves is a wave 
in which the field is a simply periodic function of the time. Such a 
wave is said to be monochromatic. All quantities (potentials, field 
components) in a monochromatic wave depend on the time through a 
factor of the form cos (cot+a). The quantity to is called the angular 
frequency of the wave (we shall simply call it the frequency). 

In a plane wave (propagating along the x axis), the field is a function 
only of t—(x/c). Therefore, if the plane wave is monochromatic, its 
field is a simply periodic function of t—(x/c). The vector potential of 
such a wave is most conveniently written as the real part of a complex 
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expression : 

A = Re{A 0 e- to <'-*W}. (70.1) 

Here A 0 is a certain constant complex vector. Obviously, the fields 
E and H of such a wave have analogous forms with the same frequency 
co. The quantity 

A = 2ac/a> (70.2) 

is called the wavelength ; it is the period of variation of the field with the 
coordinate x at a fixed time /. 
The vector 

k = — n (70.3) 

c 

(where n is a unit vector along the direction of propagation of the 
wave) is called the wave vector. In terms of it we can write (70.1) in the 
form 

A = Re {Aoe**-'-"')}, (70.4) 

which is independent of the choice of coordinate axes. The quantity 
which appears multiplied by i in the exponent is called the phase of 
the wave. 

So long as we perform only linear operations, we can omit the sign 
Re for taking the real part, and operate with complex quantities as 
such.* Thus, substituting 

A = Aoe**-'— "> 



t If two quantities A(f) and B(f) are written in complex form 

A(f) = A 0 e -«"■<, B(f) = B 0 e-*»<, 

then in forming their product we must first, of course, separate out the real part. 
But if, as frequently happens, we are interested only in the time average of this 
product, it can be computed as 

iRe{AB*}, 

since we have 

Re A Re B = i(A 0 e- to ' + A*e to ')-(B 0 e- to ' + B*e to '), 

and when we average, the terms containing factors e ±ii0 " vanish, so that we are 
left with 

Re A • ReB = i(AB* + A*-B) = \ Re (A-B*). 
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in (69.3), we find the relation between the intensities and the vector 
potential of a plane monochromatic wave in the form 

E = ikA, H = ikXA. (70.5) 

We shall now treat in more detail the direction of the field of a 
monochromatic wave. To be specific, we shall talk of the electric field 

E = Re {E 0 e'*"-"")} 

(everything stated below applies equally well, of course, to the magnetic 
field). The quantity E 0 is a certain complex vector. Its square E 2 , is 
(in general) a complex number. If the argument of this number is 
-2a (i.e. E 2 = |E 2 |e~ 2fa ), the vector b defined by 

E 0 = be-*" (70.6) 

will have its square real, b 2 = |E 0 1 2 . With this definition, we write 

E = Re {be*-'-"'—)}. (70.7) 

We write b in the form 

b = bi+ib 2 , 

where bi and b 2 are real vectors. Since b 2 = b 2 — b2+2ibi-b 2 must be 
a real quantity, bi-b 2 = 0, i.e. the vectors bi and b 2 are mutually 
perpendicular. We choose the direction of bi as the y axis (and the x 
axis along the direction of propagation of the wave). We then have 
from (70.7) 

E v = Z>i cos (cot— k-r+a), 1 

' \ (70.8) 

E z = ±02 sin (cot— k-r+a), J 

where we use the plus (minus) sign if b 2 is along the positive (negative) 
z axis. From (70.8) it follows that 

pi pi 

4 + 4 = L (m9) 



Thus we see that, at each point in space, the electric field vector 
rotates in a plane perpendicular to the direction of propagation of the 
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wave, while its endpoint describes the ellipse (70.9). Such a wave is said 
to be elliptically polarised. The rotation occurs in the direction of 
(opposite to) a right-handed screw rotating along the x axis, if we have 
the plus (minus) sign in (70.8). 

If b\ = bi, the ellipse (70.9) reduces to a circle, i.e. the vector E 
rotates while remaining constant in magnitude. In this case we say 
that the wave is circularly polarised. The choice of the directions of the 
y and z axes is now obviously arbitrary. We note that in such a wave 
the ratio of the y and z components of the complex amplitude E 0 is 

EoJEoy = ±i (70.10) 

for rotation in the same (opposite) direction as that of a right-handed 
screw (right and left polarisations)^ 

Finally, if b\ or b 2 equals zero, the field of the wave is everywhere 
and always parallel (or antiparallel) to one and the same direction. 
In this case the wave is said to be linearly polarised, or plane polarised. 
An elliptically polarised wave can clearly be treated as the super- 
position of two linearly polarised waves. 



§71. The Doppler effect 

Now let us turn to the definition of the wave vector and introduce 
the four-dimensional wave vector with components 

k" = (co/c, k). (71.1) 

That these quantities actually form a four-vector is obvious from the 
fact that we get a scalar (the phase of the wave) when we multiply 
by x": 

k u x" = cot-k-r. (71.2) 

From the definitions (70.3) and (71.1) we see that the square of the 
wave four-vector is zero: 

k"^ = 0. (71.3) 
t We assume that the coordinate axes form a right-handed system. 
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Using the law of transformation of the wave four-vector we can 
easily treat the Doppler effect — the change in frequency co of the wave 
emitted by a source moving with respect to the observer, as compared 
to the "true" frequency co 0 of the same source in the reference frame 
(Ko) in which it is at rest. 

Let V be the velocity of the source, i.e. the velocity of the frame K 0 
relative to K. According to the general formula for transformation of 
four-vectors, we have 

WOO = *°-Wc) 

V[i-(F7c 2 )] 

(the velocity of the frame K relative to K 0 is — V). Substituting k° = 
co/c, k 1 = k cos a = (co/c) cos a, where a is the angle (in the frame 
K) between the direction of emission of the wave and the direction of 
motion of the source, and expressing co in terms of co 0 , we obtain 

1 — (F/c)cosa 

This is the required formula. For V <scc, and if the angle a is not too 
close to n/2, it gives 

co fi) 0 |l+^-cosaj . (71.5) 

For a = n/2, we have 

co = cooVtl-C^/c 2 )] ~ a>o[l-(F"/2c*)]; (71.6) 

in this case the relative change in frequency is proportional to the 
square of the ratio V/c. 



§72. Spectral resolution 

Every wave can be subjected to the process of spectral resolution, 
i.e. can be represented as a superposition of monochromatic waves 
with various frequencies. The character of this expansion varies 
according to the character of the time dependence of the field. 

One category consists of those cases where the expansion contains 
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frequencies forming a discrete sequence of values. The simplest case 
of this type arises in the resolution of a purely periodic (though not 
monochromatic) field. This is the usual expansion in Fourier series; 
it contains the frequencies which are integral multiples of the "funda- 
mental" frequency co 0 = 2n/T, where T is the period of the field. We 
write it in the form 

/= £ /.e-*-' (72.1) 

n = — oo 

(where / is any of the quantities describing the field). The quantities 
/„ are defined in terms of the function /by the integrals 

772 

fn = Y l/We^'df. (72.2) 

-172 

Because /(/) must be real, 

/_„ =/;• (72.3) 

In more complicated cases, the expansion may contain integral 
multiples (and sums of integral multiples) of several different in- 
commensurable fundamental frequencies. 

When the sum (72.1) is squared and averaged over the time, the 
products of terms with different frequencies give zero because they 
contain oscillating factors. Only terms of the form /„/_„ = |/J 2 
remain. Thus the average of the square of the field, i.e. the average 
intensity of the wave, is the sum of the intensities of its monochromatic 
components : 

oo oo 

P= E l/J 2 = 2£ l/J 2 (72.4) 

n = — oo n = l 

(where it is assumed that the average of the function / over a period is 
zero, i.e./o = / = 0). 

Another category consists of fields which are expandable in a 
Fourier integral containing a continuous sequence of different fre- 
quencies. For this to be possible, the function f(t) must satisfy certain 
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definite conditions; usually we consider functions which vanish for 
/ — ± °° . Such an expansion has the form 

eo 

M= |/- e -'°"-£. (72-5) 

where the Fourier components are given in terms of the function /(/) 
by the integrals 

/-= J/(0e""'df. (72.6) 

— oo 

Analogously to (72.3), 

/— = /*• (72.7) 

Let us calculate the integral of f 2 over all time. Using (72.5) and 
(72.6), we have 



oo oo 



— oo — oo — oo 



dco 
~2n' 



or, using (72.7), 



f/'d«= f l /. P g = 2f|/„ l ^. (72.8) 

— oo — oo 0 

Thus the integrated intensity can be expressed in terms of the intensities 
of the Fourier components of the wave. 



§73. Partially polarised light 

Every monochromatic wave is, by definition, necessarily polarised. 
However, we usually have to deal with waves which are only approxi- 
mately monochromatic, and which contain frequencies in a small 
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interval Aw. We consider such a wave, and let co be some average 
frequency for it. Then its field (to be specific we shall consider the 
electric field E) at a fixed point in space can be written in the form 

E = E 0 (f)e-''°", 

where the complex amplitude Eo(f) is some slowly varying function of 
the time (for a strictly monochromatic wave Eo would be constant). 
Since Eo determines the polarisation of the wave, this means that at 
each point of the wave its polarisation changes with time; such a wave 
is said to be partially polarised. 

The polarisation properties of electromagnetic waves, and of light 
in particular, are observed experimentally by passing the light to be 
investigated through various bodies* and then observing the intensity 
of the transmitted light. From the mathematical point of view this 
means that we draw conclusions concerning the polarisation properties 
of the light from the values of certain quadratic functions of its field. 
Here of course we mean the time averages of such functions. 

Quadratic functions of the field are made up of terms proportional 
to the products E t E k , E*E\ or E,E k . Products of the form 

E t E k = Evfafi-**, EfE* k = £o% t e^', 

which contain the rapidly oscillating factors e ±2fo ', give zero when the 
time average is taken. The products E t E* k = E m El k do not contain 
such factors, and so their averages are not zero. Thus we see that the 
polarisation properties of the light are completely characterised by the 
tensor 

Jik — Eo/Eok- (73.1) 

Since the vector E 0 always lies in a plane perpendicular to the 
direction of the wave, the tensor J- A has only four components (in 
this section the indices i, k are understood to take on only two values : 
i,k= 1,2, corresponding to the y and z axes; the x axis is along the 
direction of propagation of the wave). 



t For example, through a Nicol prism. 
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The sum of the diagonal elements of the tensor J ik (we denote it by 
J) is a real quantity — the average value of the square modulus of the 
vector E 0 : 

/ = J„ = E^Ej. (73.2) 

This quantity determines the intensity of the wave, as measured by the 
energy flux density. To eliminate this quantity which is not directly 
related to the polarisation properties, we introduce in place of J ik the 
tensor 

Qik = JiklJ, (73.3) 

for which q u = 1 ; we call it the polarisation tensor. 

From the definition (73. 1) we see that the components of the tensor 
J, k , and consequently also Q ik , are related by 

Qik = Q*ki (73.4) 

(i.e. the tensor is Hermitian). Consequently the diagonal components 
q u and q 22 are real (with Q n +Q 22 = 0 while q 21 = q* 2 . Thus the 
polarisation tensor is characterised by three real parameters. 

Let us study the conditions that the tensor Q jk must satisfy for 
completely polarised light. In this case E 0 = constant, and so we have 
simply 

J ik = J Qik = E<xEok (73.5) 

(without averaging), i.e. the components of the tensor can be written 
as products of components of some constant vector. The necessary 
and sufficient condition for this is that the determinant vanish : 

\Qik\ = Q11Q22 — Q12Q21 = 0. (73.6) 

The opposite case is that of unpolarised or natural light. Complete 
absence of polarisation means that all directions (in the yz plane) are 
equivalent. In other words, the polarisation tensor must have the 
form 

Qik = H*. (73.7) 

The determinant is | Q lk | = j. 
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An arbitrary tensor g ik can be resolved into two parts, one symmetric 
(in the indices /, k) and the other antisymmetric. Let us consider the 
special case where the latter part is absent. Because of (73.4) the sym- 
metric tensor g ik is then real (g ik = g* k ). Like every symmetric tensor it 
can be brought to principal axes, with two different eigenvalues, which 
we denote by fa and fa. The directions of the principal axes are 
mutually perpendicular. Denoting the unit vectors along these direc- 
tions by n (1) and n (2) , we can represent g ik in the form 

e» - MM" + fanf>nf\ X, + fa = 1 . (73.8) 

The quantities fa and fa are positive and take on values from 0 to 1. 

Each of the two terms in (73.8) has the form of a product of two 
components of a constant real vector (-\/Ain (1) or v%n (z) ). In other 
words, each of the terms corresponds to linearly polarised light. We 
also see that there is no term in (73.8) containg products of components 
of the two waves. This means that the two parts can be regarded as 
physically independent, or incoherent. For, if two waves are independ- 
ent of one another, the average value of the product E^Efp is equal 
to the product of the average values of the two factors, and since 
each of them is zero, 

Thus in the case considered here the partially polarised wave can be 
represented as a superposition of two incoherent waves (with intensities 
proportional to fa and A 2 ), linearly polarised along mutually perpendic- 
ular directions. (In the general case of a complex tensor g ik , one can 
show that the light can be represented as a superposition of two 
incoherent elliptically polarised waves, whose polarisation ellipses are 
similar and mutually perpendicular.) 

§74. Geometrical optics 

A plane wave is characterised by the property that its direction of 
propagation and amplitude are the same everywhere. Arbitrary electro- 
magnetic waves, of course, do not have this property. 
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Nevertheless, a great many electromagnetic waves, which are not 
plane, have the property that within each small region of space they 
can be considered to be plane. For this, it is clearly necessary that the 
amplitude and direction of the wave remain practically constant 
over distances of the order of the wavelength. 

If this condition is satisfied, we can introduce the wave surfaces, 
i.e. surfaces at all of whose points the phase of the wave is the same 
(at a given time). (The wave surfaces of a plane wave are planes per- 
pendicular to the direction of propagation of the wave.) In each small 
region of space we can speak of a direction of propagation of the wave, 
normal to the wave surface. In this way we can introduce the concept 
of rays — curves whose tangents at each point coincide with the direc- 
tion of propagation of the wave. 

The study of the laws of propagation of waves in this case constitutes 
the domain of geometrical optics. Consequently, geometrical optics 
considers the propagation of electromagnetic waves, in particular of 
light, as the propagation of rays, completely divorced from their wave 
properties. In other words, geometrical optics corresponds to the 
limiting case of small wavelength, A -► 0. 

We now take up the derivation of the fundamental equation of 
geometrical optics — the equation determining the direction of the 
rays. Let /be any quantity describing the field of the wave (any compo- 
nent of E or H). For a plane monochromatic wave, / has the form 



(we omit the Re; it is understood that we take the real part of all 
expressions). 
We write the expression for the field in the form 



If the wave is not plane, but geometrical optics is applicable, the 
amplitude a is, generally speaking, a function of the coordinates and 
time, and the phase ip, which is called the eikonal, does not have a 
simple form, as in (74. 1). It is important to note, however, that y> is a 
large quantity. This is clear immediately from the fact that it changes 




(74.1) 



/= adv. 



(74.2) 
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by 2k when we move through one wavelength, and geometrical optics 
corresponds to the limit X — 0. 

Over small space regions and time intervals the eikonal y> can be 
expanded in series; to terms of first order, we have 

(the origin for coordinates and time has been chosen within the space 
region and time interval under consideration; the derivatives are 
evaluated at the origin). Comparing this expression with (74.1), we 
can write 

k = ^ s gradv», » = (74.3) 

which corresponds to the fact that in each small region of space (and 
each small interval of time) the wave can be considered as plane. 
From the definition of the wave vector, we have k 2 = £0 2 /c 2 . Sub- 
stituting k and co from (74.3), we find 

(W)' = iflSt. (74.4) 



This first-order partial differential equation is called the eikonal equa- 
tion and is the fundamental equation of geometrical optics. 

Equation (74.4) can also be derived by a direct transition to the 
limit X — 0 in the wave equation. The field / satisfies the wave equation 

For a function of the form (74.2) we have 

9 ,2 " 9,2 * + 21 g, 9, & *+V 9,2 [dt) J - 



But the eikonal y> in geometrical optics is a large quantity. We can 
therefore neglect the first three terms compared with the fourth, so 
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that 



Similarly, we find 



8f 2 " 



a/« -(vv)V. 



and substitution in (74.5) gives (74.4). 

From the form of the eikonal equation there results a remarkable 
analogy between geometrical optics and the mechanics of material 
particles. The motion of a material particle is determined by the 
Hamilton- Jacobi equation for the action 5* (§31). This equation, like 
the eikonal equation, is an equation in first partial derivatives. The 
action 5* is related to the momentum p and the Hamiltonian '%> of the 
particle by the relations 

n dS or dS 



Comparing these formulae with the formulae (74.3), we see that the 
wave vector plays the same role in geometrical optics as the momentum 
of the particle in mechanics, while the frequency plays the role of the 
Hamiltonian, i.e., the energy of the particle. The absolute magnitude 
of the wave vector is related to the frequency by the formula k = co/c. 
This relation is analogous to the relation p = S/c between the momen- 
tum and energy of a particle with zero mass, and its velocity which is 
equal to the velocity of light. 
For a particle, we have the Hamilton equations 

833 . 833 

P = -8T' y = r = W 

In view of the analogy we have pointed out, we can immediately write 
the corresponding equations for rays : 



. So) . _ 8a) 
~ ~~87 ' r ~ 8k ' 



(74.6) 
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In vacuum, co = ck, so that k = 0, v = cn (n is a unit vector along the 
direction of propagation); in other words, as it must be, in vacuum 
the rays are straight lines, along which the light travels with velocity c. 1 



From the definition of a monochromatic plane wave, its amplitude 
is the same everywhere and at all times. Such a wave is infinite in 
extent in all directions in space, and exists over the whole range of time 
from — °° to + °° . Any wave whose amplitude is not constant every- 
where at all times can only be more or less monochromatic. We now 
take up the question of the "degree of non-monochromaticity" of a 
wave. 

Let us consider an electromagnetic wave whose amplitude at each 
point is a function of the time. Let co 0 be some average frequency of the 
wave. Then the field of the wave, for example the electric field, at a 
given point has the form E 0 (0e~" Uo '. This field, although it is of course 
not monochromatic, can be expanded in monochromatic waves, that 
is, in a Fourier integral. The amplitude of the component in this 
expansion, with frequency co, is proportional to the integral 



The factor e' (< "~ < " o) ' is a periodic function whose average value is 
zero. If E 0 were exactly constant, then the integral would be exactly 
zero, for co ?£ co 0 . If, however, E 0 (?) is variable, but hardly changes 
over a time interval of order l/\co—co 0 \, then the integral is almost 
equal to zero, the more nearly the slower the variation of E 0 . In order 
for the integral to be significantly different from zero, it is necessary 



t It is true that when applied to the propagation of light in vacuum these equa- 
tions lead to results already known, but the important point is that in their general 
form these results are also applicable to the propagation of light in material media. 
In just this case we find an analogy to the motion of particles in an external force 
field. 
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that E o (0 vary significantly over a time interval of the order of 

l/|£O-£O 0 |. 

We denote by A* the order of magnitude of the time interval during 
which the amplitude of the wave at a given point in space changes 
significantly. From these considerations, it now follows that the fre- 
quencies deviating most from coo, which appear with appreciable 
intensity in the spectral resolution of this wave, are determined by the 
condition 1/|£0— £Oo| ~ At. If we denote by Aco the frequency interval 
(around the average frequency a> 0 ) which enters into the spectral 
resolution of the wave, then we have the relation 

Aco At ~ 1. (75.1) 

We see that a wave is the more monochromatic (i.e. the smaller Aco) 
the larger At, i.e. the slower the variation of the amplitude at a given 
point in space. 

Relations similar to (75.1) are easily derived for the wave vector. 
Let Ax, Ay, Az be the orders of magnitude of distances along the x, y, 
z axes, in which the wave amplitude changes significantly. At a given 
time, the field of the wave as a function of the coordinates has the 
form 

E 0 (r)e* k »- r , 

where ko is some average value of the wave vector. By a completely 
analogous derivation to that for (75.1) we can obtain the interval Ak 
of values contained in the expansion of the wave into a Fourier 
integral : 

Ak x Ax ~ 1, Ak y Ay ~ 1, AA:,Az ~ 1. (75.2) 

Let us consider, in particular, a wave which is radiated during a 
finite time interval. We denote by At the order of magnitude of this 
interval. The amplitude at a given point in space changes significantly 
during the time At in the course of which the wave travels completely 
past the point. Because of the relation (75.1) we can now say that the 
"lack of monochromaticity" of such a wave, Aco, cannot be smaller 
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than 1 1 At (it can of course be larger): 

Aa>;>~. (75.3) 

Similarly, if Ax, Ay, Az are the orders of magnitude of the extension 
of the wave in space, then for the spread in the values of components 
of the wave vector, entering into the resolution of the wave, we 
obtain 

Afc,;>^, Afc,;>^L, Afc,^. (75.4) 

From these formulae it follows that, if we have a beam of light of 
finite width, then the direction of propagation of the light in such a 
beam cannot be strictly constant. Taking the x axis along the (average) 
direction of light in the beam, we obtain 

_J _ A 
k 'Ay ~ Ay ' 



», £ irrr. ~ ~rr. . ( 75 - 5 ) 



where 6 y is the order of magnitude of the deviation of the beam from 
its average direction in the xy plane and A is the wavelength. 

On the other hand, formula (75.5) answers the question of the limit 
of sharpness of optical image formation. A beam of light whose rays, 
according to geometrical optics, would all intersect in a point, actually 
gives an image not in the form of a point but in the form of a spot. 
For the width A of this spot, we obtain, according to (75.5), 

A ~W~T> (75 - 6) 

where 6 is the opening angle of the beam. This formula can be applied 
not only to the image but also to the object. Namely, we can state that 
in observing a beam of light emerging from a luminous point, this 
point cannot be distinguished from a body of dimensions A/0. In this 
way formula (75.6) determines the limiting resolving power of a 
microscope. The minimum value of A, which is reached for 0 ~ 1, is A, 
in complete agreement with the fact that the limit of geometrical optics 
is determined by the wavelength of the light. 
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PROBLEM 

Determine the order of magnitude of the smallest width of a light beam pro- 
duced from a parallel beam at a distance / from a diaphragm. 

Solution. Denoting the size of the aperture in the diaphragm by d, we- have 
from (75.5) that the angle of deflection of the beam (the "diffraction angle") is ~ kid, 
so that the width of the beam is of order d+(k/d)l. The smallest value of this 
quantity is ~ V(M). 

§76. Fresnel diffraction 

The deviations from the laws of geometrical optics due to the finite 
wavelength of light cause the phenomena of diffraction. Diffraction 
phenomena can be observed, for example, if along the path of prop- 
agation of the light there is an obstacle — an opaque body (we call it a 
screen) or if the light passes through holes in opaque screens. If the 
laws of geometrical optics were strictly satisfied, there would be beyond 
the screen regions of shadow sharply delineated from regions where 
light falls. The diffraction has the consequence that, instead of a 
sharp boundary between light and shadow, there is a quite complex 
distribution of the intensity of the light. These diffraction phenomena 
appear the more strongly the smaller the dimensions of the screens 
and the apertures in them, relative to the wavelength. 

The problem of diffraction can be discussed in general form in 
cases of small deviation from geometrical optics, i.e. when firstly, the 
dimensions of all bodies are large compared with the wavelength 
(this requirement applies both to the dimensions of screens and also 
to the distances from them to the points of emission and observation 
of the light) ; and secondly when there are only small deviations of the 
light from the directions of the rays given by geometrical optics. 

To be specific, let us consider a screen with an aperture through 
which the light passes. Figure 30 shows the screen in profile (the vertical 
line) ; the light travels from left to right. We denote by u some one of 
the components of E or H, a function only of the coordinates, i.e. 
without the factor e~'°". Our problem is to determine the field u P 
(and hence the light intensity ~ |w P | 2 ) at any point of observation P 
beyond the screen. For an approximate solution of this problem in 
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cases where the deviations from geometrical optics are small, we may 
assume that at the points of the aperture the field is the same as it 
would have been in the absence of the screen. 

We introduce some surface S which covers the aperture in the 
screen and is bounded by its edges (a profile of such a surface is 
shown in Fig. 30 as a dashed line). We break up this surface into sections 
with area d/, whose dimensions are small compared with the size of 
the aperture, but large compared with the wavelength of the light. 




p 



We can then consider each of these sections through which the light 
passes as if it were itself a source of light waves spreading out on all 
sides from this section. We shall consider the field at the point P to be 
the result of superposition of the fields of waves coming from all the 
sections of the surface S. (This is called Huygens' principle.) 

The field produced at the point P by the section d/ is proportional 
to the value u of the field at the section d/ itself. In addition, it is 
proportional to the projection df„ of the area d/ on the plane perpendic- 
ular to the direction n of the ray coming from the light source. This 
follows from the fact that no matter what position the element d/has, 
the same rays will pass through it provided its projection df n remains 
fixed, and therefore its effect on the field at P will be the same. 

When the wave is propagated from d/ to P, its phase changes by kR 
(where k is the magnitude of the wave vector of the light and R is the 
distance from d/ to P); this leads to a factor exp (ikR). The wave 
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amplitude decreases over this distance by a factor* ]/R. Thus the 
contribution to the field u P from each element of the surface S is 
proportional to df n -ue' kR /R. 

The total value of u P is found by integrating over S: 



this is the mathematical form of Huygens' principle. In the approxi- 
mation we are considering, the field u P depends only on the shape 
of the edge of the aperture, not on that of the screen itself or on the 
material of which it is made. 

If a light source Q and a point of observation P are at finite distances 
from the screen, only a small part of the surface S near its intersection 
with the line QP will contribute significantly to the integral (76.1): 
since the deviations from geometrical optics are small, the intensity of 
the light arriving at P from various points of the surface S decreases 
very rapidly as we move away from this line, which corresponds to 
propagation along a geometrical ray from source to observation 
point. Diffraction phenomena in which only a small portion of the 
wave surface plays a role are called Fresnel diffraction phenomena. 

Let us consider the Fresnel diffraction by the straight edge of a 
screen. We choose the xz plane so that it passes through the point Q 
and the point of observation P, and the y axis on the line of the edge 
of the screen, as in Fig. 31. 

Let the distance from the light source Q to the origin be D q . We 
denote the x coordinate of the point of observation P by D p , and its 
z coordinate, i.e. its distance from the xy plane, by d; the region 
d < 0 below the xy plane is the region which according to geometrical 
optics should be in shadow (region of geometrical shadow). 



t When a wave is propagated from a point source, its intensity decreases as 
1 /R 2 , since the total energy flux is constant and is distributed over a surface which 
increases as R 2 . The intensity, in turn, is proportional to the square of the wave 
amplitude. 




(76.1) 
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X 



Fig. 31 



We now determine the distribution of light intensity on the screen 
near the edge of the geometrical shadow, i.e. for values of d small 
compared with D p and D q . 

As the surface of integration in (76.1) we choose the upper yz half- 
plane. As shown above, only the region near the origin, where y and 
z are small compared with D q and D p , makes an important contribution 
to the integral. For this integration it is sufficient to retain only the 
rapidly varying exponentials in the integrand, regarding the factor 
l/R as constant. 

The field u of the wave leaving Q is proportional to exp (ikR q ) at a 
distance R q . For points on the surface of integration, 



1 

'i 

and the field is 



/ y 2 + Z 2 \ 

u ~ exp (ikR q ) ~ exp lik — j . 

The distance of the point of observation is 

R = V[y 2 Hz-d)*+B$] « D p + ^- [y*+(z-dn 



2D P 



We substitute this in (76.1). We then find 



Up ~ 

o 



j exp {'"(2i7 z2+ W (z ^ )2 )} dz - (762) 
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Here the constant factors independent of d, including the integral 
with respect to y, are omitted. The integration can be extended to 
infinity because the integral in (76.2) is rapidly convergent, even 
though the form of the integrand contains the assumption that 
d « D q , D p . 

The expression (76.2) can also be written in the form 



up ~ exp 



{^2^ + ^)} J 



exp • 



ik 



/ 1 1 \ d "I s 

U+T^y—D;] 



1 1 



• dz. 



The light intensity is determined by the square modulus \ u P \ 2 , which 
does not contain the phase factor preceding the integral. An obvious 
substitution reduces the integral itself to 



Up 



where 



w = d 



j e'i'dr), 

-W 

kD a 



2D p (D q + D p ) 
Thus, the intensity / at the point P is 



(76.3) 
(76.4) 



7 = ^1 fe* 

71 I J 



(76.5) 



it will be shown below that J 0 is equal to the intensity in the illuminated 
region at points far from the edge of the shadow. 

The region of geometrical shadow corresponds to negative w. Let us 
find the form of the function I(w) for large values of |w|, within the 
shadow. Integrating by parts, we have 



oo 

\w\ 



e'"' dt] = 



1 

2*| h> | 



oo 



drj 
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Integrating by parts once more on the right side of the equation, 
we obtain an expression proportional to l/|w| 3 . Thus, retaining only 
the term which decreases least rapidly with increasing |w|, 



oo 

J 



elv ' dv % ~27^[ e ""- (76 - 6) 



Hence, for the intensity I(w), (76.5), we obtain the following asymptotic 
formula, valid for large negative values of w : 

/ = /o/4w ? . (76.7) 

We now consider positive values of w, that is, the illuminated region. 
We write 

oo + oo —w oo 

j* e"»* drj = j* e"»' dr? - j* e'"' dr? = (1 + 1) j/^- - j* e*>' drj. 

— W — oo — oo w 

For sufficiently large w, we can use the approximation (76.6) for the 
integral standing on the right-hand side of the equation, and we then 
easily obtain 



/ = 7 0 (i + rin <*-**n . 



(76.8) 



Thus in the illuminated region, far from the edge of the shadow, the 
intensity has an infinite sequence of maxima and minima, so that the 
ratio 7//o oscillates on both sides of unity. With increasing w, the 
amplitude of these oscillations decreases, and the positions of the 
maxima and minima steadily approach one another. 

For small w, the function I{w) has qualitatively this same character 
(Fig. 32). In the region of the geometric shadow, the intensity decreases 
monotonically as we move away from the boundary of the shadow. 
(On the boundary itself, I/h = \.) For positive w, the intensity has 
alternating maxima and minima. At the first (largest) maximum, 
///„ = 1.37. 
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§77. Fraunhofer diffraction 



Of special interest for physical applications are those diffraction 
phenomena which occur when a plane parallel bundle of rays is 
incident on a screen. As a result of the diffraction, the beam ceases to 
be parallel, and there is light propagation along directions other than 
the initial one. Let us consider the problem of determining the distribu- 
tion over direction of the intensity of the diffracted light at large 



distances beyond the screen (this formulation of the problem corre- 
sponds to Fraunhofer diffraction). Here we shall again restrict ourselves 
to the case of small deviations from geometrical optics, i.e. we shall 
assume that the angles of deviation of the rays from the initial direction 
(the diffraction angles) are small.* 

Let us consider, for example, the Fraunhofer diffraction of a plane 
wave normally incident on an infinite slit (of width 2a) with parallel 
sides cut in an opaque screen. We choose the plane of the slit as the yz 
plane, with the z axis along the slit (Fig. 33 shows a section of the 
screen). By symmetry, the light is deflected only in the xy plane. 

Let us denote by u 0 the field which would exist beyond the screen if 



t This problem can be solved by starting from the general formula (76.1) and 
passing to the limit where the light source and the point of observation are at 
infinite distances from the screen. A characteristic feature of the case we are 
considering is that, in the integral, the whole surface S over which the integral is 
taken is important (in contrast to the case of Fresnel diffraction, where only a 
small part of the surface is important). However, it is simpler to treat this problem 
anew, without recourse to the general formula (76.1), and we shall now do so. 



■0 




w 



Fig. 32 
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geometrical optics were rigorously valid. It is an infinite strip of width 
2a cut from the plane wave. Actually, however, a wave with a limited 
cross-sectional area cannot be strictly plane (§75). In its spatial Fourier 
expansion there appear components with wave vectors having different 
directions, and this is precisely the origin of the diffraction. 

In any plane parallel to that of the slit, u 0 is non-zero and constant 
for — a < y < a. We expand this field as a Fourier integral with 
respect to y: 



Uq — Ho | e-'irdy = — ° sin qa. (77.1) 
*7 



For small deviations from geometrical optics (small q) the compo- 
nents in the expansion of the field u 0 can be assumed to be identical 
with the components of the actual field of the diffracted light, so that 
formula (77. 1) solves our problem. 

The wave vector k of the incident light is along the x axis. The 
wave vector corresponding to the component u q of the diffracted wave 
is k' = k+q, and lies in the xy plane at an angle 6 to the x axis which 
is small and is such that q % k6 = coO/c. The intensity of light diffract- 
ed in the range dq is proportional to \u q \ 2 dq. Taking u q from (77.1), 
we obtain for the angular distribution of the diffracted light 

d/=4 ^de. (77.2) 
nak (r 



This is normalised so that 7 0 is the total intensity, which is the same as 
that of the incident beam.* 

dl/dO as a function of the diffraction angle has the form shown in 
Fig. 34. As 6 increases toward either side from 6 = 0, the intensity 
goes through a series of maxima with rapidly decreasing height. The 



t This is easily shown by using the well-known result 
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successive maxima are separated by minima at the points 6 = nn/ka 
(where n is an integer) ; at the minima, the intensity falls to zero. 

Let us consider the Fraunhofer diffraction from two screens which 
are "complementary", one having holes where the other is opaque: 
for example, an infinite slit and an infinite strip. We denote by 
and uf) the fields which would exist beyond the screens according to 



sin 2 x 




Fig. 33 Fig. 34 



geometrical optics. Since the two sets of holes in the screens together 
form an entire plane, the sum u^+uf^ is just the total incident plane 
wave; this has a quite definite direction of propagation, and so its 
Fourier component u^+u^ = 0 for all q ^ 0. Hence the intensities 
are such that Iw^'l 2 = |«[, 2) | 2 , and therefore complementary screens 
give the same distribution of intensity of the diffracted light (this is 
called Babinefs principle). 



§78. Characteristic vibrations of the field 

We consider a free electromagnetic field (no charges) in some finite 
volume of space. To simplify further calculations we assume that this 
volume has the form of a rectangular parallelepiped with sides A, B, C, 
respectively. Then we can expand all quantities characterising the 
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field in this parallelepiped in a triple Fourier series (for the three co- 
ordinates). This expansion can be written (e.g. for the vector potential) 
in the form 

A = I(ake*-' + «2e-*-'), (78.1) 

k 

explicitly indicating that A is real. The summation extends here over 
all possible values of the vector k whose components run through the 
values 

k x = 2nn x /A, k y = 2nn y \B, k z = 2nn z \C, (78.2) 

where n x , n y , n z are positive and negative integers. From the equation 
div A — 0 it follows that, for each k, 

k-a k = 0, (78.3) 

i.e., the complex vectors a k are perpendicular to the corresponding 
wave vectors k. The vectors a k tire, of course, functions of the time; 
they satisfy the equation 

a k + c 2 A: 2 a k = 0. (78.4) 

If the dimensions A, B, C of the volume are sufficiently large, then 
neighbouring values of k x , k v , k z (for which n x , n y , n z differ by unity) 
are very close to one another. In this case we may speak of the number 
of possible values of k x , k y , k z in the small intervals Ak x , Ak y , Ak z . 

Since to neighbouring values of, say, k x , there correspond values of 
n x differing by unity, the number An x of possible values of k x in the 
interval Ak x is equal simply to the number of values of n x in the corre- 
sponding interval. Thus we obtain 

An x = AAk x j2n, An y = B Ak y j2n, An z = C Ak z \2u. 

The total number An of possible values of the vector k with components 
in the intervals Ak x , Ak y , Ak z is equal to the product An x An y An z , 
that is, 

An = ~^ Ak x Ak y Ak z , (78.5) 
where V = ABC is the volume of the field. 
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It is easy to determine from this the number of possible values of the 
wave vector having absolute values in the interval Ak, and directed 
into the element of solid angle Ao. To get this we need only transform 
to spherical polar coordinates in the "k space" and write in place of 
Ak x Ak y Ak 2 the element of volume in these coordinates. Thus 

An = -^WAkAo. (78.6) 
[2ny 

Finally, the number of possible values of the wave vector with absolute 
value k in the interval Ak and pointing in all directions is (we write 4n 
in place of Ao) 

An= j^WAk. (78.7) 

The vectors a k as functions of the time behave like simply periodic 
functions with periods co k = ck (see (78.4)). We present the expansion 
of the field in such a form that it appears as an expansion in propagat- 
ing plane waves. To do this we assume that each of the a k depends on 
the time through the factor e _to *': 

a k ~ e- to *', co k = ck. (78.8) 

Then each individual term in the sum (78.1) is a function only of the 
difference k«r— a> k t, which corresponds to a wave propagating in the 
direction of the vector k. 
We calculate the total energy 

& = In J( £ *+ // *) dK 

of the field in the volume V, expressing it in terms of the quantities a k . 
For the electric field we have 

E = - I A = --£(*ke*' r +*&!-' k - r ), 

c c V 

or, keeping in mind (78.8), 

E = i £ fc(a k e' k - r - ■Je-*"). (78.9) 

k 
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For the magnetic field H = curl A, we obtain 

H = /£(kXa k e' kr -kXa k e-' kr )- (78.10) 
k 

When calculating the squares of these sums, we must keep in mind that 
all products of terms with wave vectors k ^ k' give zero on integration 
over the whole volume, since such terms contain factors of the form 
e ±iv \ q - k±k', and the integral, e.g. 

A 

J e f<2*/-4)»** dx, 
0 

with integer n x different from zero, gives zero. In those terms from 
which the exponentials drop out, integration over dV gives just the 
volume V. 
As a result, we obtain 

& = ^£{fc 2 a k .aJ + (kXa k ).(kXaH)}. 

k 

Since a k -k = 0, we have 

(kXa k ).(kXaH) = fc 2 a k .a k , 

and we obtain finally 

<5=£<5i«, U = V~a k .a||. (78.11) 

Thus the total energy of the field is expressed as a sum of the energies 
6^ associated with each of the plane waves individually. 

In a completely analogous fashion, we can calculate the total momen- 
tum of the field, 

^J SdK = ij EXHdK ' 
for which we obtain 

0- ( 78 -i2) 

This result could have been anticipated in view of the relation between 
the energy and momentum of a plane wave (see §69). 
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The expansion (78.1) succeeds in expressing the field in terms of a 
series of discrete variables (the vectors a k ), in place of the description 
in terms of a continuous series of variables, which is essentially what is 
done when we give the potential A(x, y, z, t) at all points of space. 
We now make a transformation of the variables a k , which has the 
result that the equations of the field take on a form similar to the 
canonical equations (Hamilton's equations) of mechanics. 

We introduce the real "canonical variables" Q k and P k according 
to the relations 

V 




(a k + a£), 
4^(a k -a k ) = Q k . 



(78.13) 



The Hamiltonian of the field is obtained by substituting these 
expressions in the energy (78.11): 

76 = = Xt(P£ + o>2Q2)- (78.14) 

k k 

Then Hamilton's equations d76jd¥ k = Q k coincide with P k = Q k , 
which is thus a consequence of the equations of motion. (This was 
achieved by an appropriate choice of the coefficient in (78.13).) The 
equations of motion 635/6Q k = — P k become the equations 

Q k +oiQ k = 0, (78.15) 

that is, they are identical with the equations of the field. 

Each of the vectors Q k and P k is perpendicular to the wave vector 
k, i.e. has two independent components. The direction of these vectors 
determines the direction of polarisation of the corresponding travelling 
wave. Denoting the two components of the vector Q k (in the plane 
perpendicular to k) by Q kj , j = 1, 2, we have 

Qi = iei/, 

j 

and similarly for P k . Then 

76 = ^76^, 76 kJ = UHj+oi&i))- (78.16) 
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We see that the Hamiltonian splits into a sum of independent terms 
^5 kj , each of which contains only one pair of the quantities Q kp P kj . 
Each such term corresponds to a travelling wave with a definite wave 
vector and polarisation. The quantity 76 kj has the form of the Hamilto- 
nian of a one-dimensional "oscillator", executing a simple harmonic 
vibration. For this reason, one sometimes refers to this result as the 
expansion of the field in terms of oscillators. 



CHAPTER 14 



RADIATION OF ELECTROMAGNETIC 
WAVES 



§79. The retarded potentials 

Let us derive the equations determining the potentials of the field 
produced by moving charges. To do this we repeat the derivation given 
in §68, but now do not assume that the charge and current densities 
are zero. 

Substituting the definitions 



in the equation 



1 9A 

E=---^--v<£, H = curlA (79.1) 
c of 



, „ 4n . 1 9E 
curlH = — j + — 

c c at 



we get 

, . » . j l 4rc . 1 9 2 A . 

curlcurlA = -AA+graddivA = — j - ^ - grad 

(79.2) 

(where we have interchanged the order of the operations grad and 
9/9/ in the last term). 
As the supplementary condition on the potentials we now choose 

divA+-^- = 0. (79.3) 
c at 
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This condition is called the Lorentz condition, and the potentials 
satisfying it are said to be in the Lorentz gauge? Then the last terms 
on both sides of (79.2) cancel and we arrive at the equation 

AA-^-^ = - — J- (79.4) 



Similarly, substituting (79.1) in the equation div E = 4ng, we get 

— — ,f-divA — Ad) = 4np, 
c at 

or, expressing div A from the condition (79.3), 

A0-^-|^ = -4^. (79.5) 

Equations (79.4), (79.5) are the desired equations. For constan 
fields, these reduce to the already familiar equations (59.4) and (65.4) 
and for variable fields without charges, to the homogeneous wave 
equation. 

As we know, the solution of the inhomogeneous linear equations 
(79.4) and (79.5) can be represented as the sum of the solution of these 
equations without the right-hand side, and a particular integral of 
these equations with the right-hand side. To find the particular solution, 
we divide the whole space into infinitely small regions and determine 
the field produced by a charge located in one of these volume elements. 
Because of the linearity of the field equations, the actual field will be 
the sum of the fields produced by all such elements. 

The charge de in a given volume element is, generally speaking, a 
function of the time. If we choose the origin of coordinates in the 
volume element under consideration, then the charge density is 



t The condition (79.3) is more general than the conditions (j> = 0, div A = 0 
that were used in §68; potentials that satisfy the latter conditions also satisfy 
(79.3). Unlike those conditions, the Lorentz condition has an invariant character: 
potentials that satisfy it in one frame of reference also satisfy it in every other 
frame of reference. This is clear from the fact that the condition (79.3) can be written 
in a four-dimensional invariant form : 



6A"ldxr - 0. 
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q = de(t) <5(R), where R is the distance from the origin. Thus we must 
solve the equation 

^ ~ c l = - 4?r de « W ( 79 - 6 ) 

Everywhere, except at the origin, <5(R) = 0, and we have the equation 

1 d 2 d> 

^-^^ = 0. (79.7) 

It is clear that in the case we are considering 4> has central symmetry, 
i.e., 4> is a function only of R. Therefore if we write the Laplace 
operator in spherical coordinates, (79.7) reduces to 

R? dR\ dR J c 2 dt 2 

To solve this equation, we make the substitution 4> = %(R, t)/R. 
Then, we find for % 

1 = 

dR 2 c 2 dt 2 

But this is the equation of plane waves, whose solution has the form 
X =f 1 [t-(R/c)]+Mt+(R/c)]. 

Since we only want a particular solution of the equation, it is 
sufficient to choose only one of the functions fi and j\. Usually it 
turns out to be convenient to take f 2 = 0 (concerning this, see below). 
Then, everywhere except at the origin, (j> has the form 

4> = X [t ~f IC)] ■ (79.8) 
R 

So far the function % is arbitrary ; we now choose it so that we also 
obtain the correct value for the potential at the origin In other words, 
we must select % so that at the origin equation (79.6) is satisfied. This is 
easily done, noting that as R ■— 0, the potential increases to infinity, 
and therefore its derivatives with respect to the coordinates increase 
more rapidly than its time derivatives. Consequently as R -» 0, we can, 
in equation (79.6), neglect the term (l/c 2 )(8 2 </>/8f 2 ) compared with A(j>. 
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Then it goes over into the familiar equation (59. 10) leading to Coulomb's 
law. Thus, near the origin, (79.8) must go over into Coulomb's law, 
from which it follows that %(t) = de(t), that is, 

de[t-(R/c)] 



R 



From this it is easy to get to the solution of equation (79.5) for an 
arbitrary distribution of charges q(x, y, z, /). To do this, it is sufficient 
to write de = q dV (dV is the volume element) and integrate over the 
whole space. To this solution of the inhomogeneous equation (79.5) 
we can still add the solution <$> 0 of the same equation without the right- 
hand side. Thus, the general solution has the form 



l e [r',f-(*/c)]dr+0o, (79.9) 



<«r, t) = 

R = r-r', dV' = dx'dy'dz' 

where 

r = (x, y, z), r' = (x', y', z'); 

R is the distance from the volume element dV' to the "field point" at 
which we determine the potential. We shall write this expression 
briefly as 



4> = 



--'^dV+fo, (79.10) 



where the subscript means that the quantity q is to be taken at the 
time t—(R/c), and the prime on dV has been omitted. 
Similarly we have for the vector potential 



l_f j*-* 
c] R 



Rlc dF+Ao, (79.11) 



where A 0 is the solution of equation (79.4) without the right-hand term. 

The potentials (79.10) and (79.11) (without (j> 0 andAo) are called the 
retarded potentials. 

If the charges are at rest (i.e. density q independent of the time), 
formula (79. 10) goes over into the known formula (59.9) for the electro- 
static field potential; for the case of stationary motion of the charges, 
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formula (79.11), after averaging, goes over into formula (65.5) for the 
vector potential of a constant magnetic field. 

The quantities A 0 and (j> 0 in (79.10) and (79.1 1) are to be determined 
so that the conditions of the problem are fulfilled. To do this it is 
clearly sufficient to impose initial conditions, that is, to fix the values 
of the field at the initial time. However, we do not usually have to deal 
with such initial conditions. Instead we are usually given conditions 
at large distances from the system of charges throughout all time. Thus, 
we may be told that radiation is incident on the system from outside. 
Corresponding to this, the field which is developed as a result of the 
interaction of this radiation with the system can differ from the external 
field only by the radiation originating from the system. This radiation 
emitted by the system must, at large distances, have the form of waves 
spreading out from the system, that is, in the direction of increasing R. 
But precisely this condition is satisfied by the retarded potentials. Thus 
these solutions represent the field produced by the system, while (j>o 
and A 0 must be put equal to the external field acting on the system. 

§80. The Lienard-Wiechert potentials 

Let us determine the potentials for the field produced by a point 
charge executing an assigned motion along a path r = r 0 (t). 

According to the formulae for the retarded potentials, the field at 
the point of observation P(x, y, z) at time t is determined by the state 
of motion of the charge at the earlier time t', for which the time of 
propagation of the light signal from the point T 0 (t'), where the charge 
was located, to the field point P just coincides with the difference 
t—t'. Let R(t) = r—T 0 (t) be the radius vector from the charge e to the 
point P; like r 0 (t) it is a given function of the time. Then the time t' is 
determined by the equation 

t'+.&- = t. (80.1) 
c 

In the frame of reference in which the particle is at rest at time t', 
the potential at the point of observation at time t is just the Coulomb 
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potential, 



*=fl^y, A = 0. (80.2) 



The expressions for the potentials in an arbitrary reference frame 
can be found directly by finding a four-vector which for v = 0 co- 
incides with the expressions just given for <j) and A. Noting that, accord- 
ing to (80.1), (j) in (80.2) can also be written in the form 

4> = 



c{t-ty 

we find that the required four-vector is 

A ' = e <£n> (80 - 3) 

where u 1 ' is the four-velocity of the charge, R" = [c(t— t'), r — r'], 
where x', y', z', t' are related by the equation (80.1). This equation 
has an invariant character, since it can be written in invariant form : 

R r R" = 0. (80.4) 

Now once more transforming to three-dimensional notation, we obtain, 
for the potentials of the field produced by an arbitrarily moving point 
charge, the following expressions: 

e es 

* = ^r/t> A = c( *-v.R/cr (8a5) 

where R is the radius vector, taken from the point where the charge is 
located to the point of observation P, and all the quantities on the 
right-hand sides of the equations must be evaluated at the time t', 
determined from (80.1). The potentials of the field, in the form (80.5), 
are called the Lienard-Wiechert potentials. 

To calculate the intensities of the electric and magnetic fields from 
the formulae 

E = grade/), H = curl A, 
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we must differentiate </> and A with respect to the coordinates x, y, z 
of the point, and the time t of observation. But the formulae (80.5) 
express the potentials as functions of t', and only through the relation 
(80.1) as implicit functions of x, y, z, t. Therefore to calculate the 
required derivatives we must first calculate the derivatives of t'. 
Differentiating the relation R{t') = c(t — t') with respect to t and r, we get 



dR _ dR dt' _ / 3; 

~df "" W "97 " c[ 



dt) 



dR dR , , dR 
gradtf = -— = -gy-gradf +-^-= -cgradr 



(80.6) 



We find the derivative dR/dt' by differentiating the identity R 2 = R 2 
and substituting dR/dt' = — v(f') (where the minus sign occurs because 
R is the radius vector from the charge e to the point P, while the velocity 
is the time derivative of the coordinates of the charge); then 

dR/dt' = -R\/R. 

The other derivative is 

dR/dr = R/R. 
Substituting these values in (80.6), we find 

lr = TTvW' giadt ' = ~^R^vfc)- (80 - 7) 

With the aid of these formulae, there is no difficulty in carrying out 
the calculation of the fields E and H. Omitting the intermediate 
calculations, we give the final results: 



E = e [i-y/?]/ R v 



(*-R.Y/ c) »r c ^) + ^^ RX {( R "^) x '}' 

(80.8) 

H = RXE. (80.9) 
R 
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Here, v = 8v/9?' ; all quantities on the right-hand sides of the equations 
refer to the time t'. It is interesting to note that the magnetic field 
turns out to be everywhere perpendicular to the electric field. 

The electric field (80.8) consists of two parts of different type. The 
first term depends only on the velocity of the particle (and not on its 
acceleration) and varies at large distances like 1 /R 2 . The second term 
depends on the acceleration, and for large R it varies like 1 /R. Later 
we shall see that this latter term is related to the electromagnetic 
waves radiated by the particle. 

As for the first term, since it is independent of the acceleration it 
must correspond to the field produced by a uniformly moving charge. 
In fact, one can show (we omit the proof) that the field given by this 
term is identical with the field (61.5). 

§81. The field of a system of charges at large distances 

Let us consider the field produced by a system of moving charges at 
distances large compared with the dimensions of the system. 

We choose the origin of coordinates O anywhere in the interior of 
the system of charges. The position vector from O to the point P, 
where we determine the field, we denote by R 0 , and the unit vector 
in this direction by n. Let the position vector of the charge element 
de = q dV be r, and the radius vector from de to the point P be R. 
Obviously R = Ro — r. 

At large distances from the system of charges, R 0 s> r, and we have 
approximately 

R = |R 0 -r| % R 0 -rn. 

We substitute this in formulae (79.10), (79.1 1) for the retarded poten- 
tials. In the denominator of the integrands we can neglect r-n compared 
with R 0 . In t—R/c, however, this is generally not permissible; whether 
it is possible to neglect these terms is determined not by the relative 
values of R 0 /c and r-n/c, but by how much the quantities q and j 
change during the time r -n/c. Since Ro is constant in the integration 
and can be taken out from under the integral sign, we get for the 
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potentials of the field at large distances from the system of charges 
the expressions 

*=-ij e '-«*+'--'' dK ' (8U) 
A = ik I *'-»<+ <°/< dK - (812) 

At sufficiently large distances from the system of charges, the field 
over small regions of space can be considered to be a plane wave. For 
this it is necessary that the distance be large compared not only with 
the dimensions of the system, but also with the wavelength of the 
electromagnetic waves radiated by the system. We refer to this region 
of space as the wave zone of the radiation. 

In a plane wave, the fields E and H are related to each other by 
(69.4), E = HXn. Since H = curl A, it is sufficient for a complete 
determination of the field in the wave zone to calculate only the vector 
potential. In a plane wave we have H = (l/c)AXn [see (69.3)], where 
the dot indicates differentiation with respect to time. Thus, knowing 
A, we find H and E from the formulae* 

H = -Axn, E = — (Axn)Xn. (81.3) 



We note that the field at large distances is inversely proportional 
to the first power of the distance R 0 from the radiating system. We 
also note that the time t enters into the expressions (81.1) - (81.3) 
always in the combination t—Ro/c. 

The radiated electromagnetic waves carry off energy. The energy 
flux is given by the Poynting vector, which, for a plane wave, is 

S — c-r—n. 
An 

The intensity d/ of radiation into the element of solid angle do is 



t The formula E = — (l/c)A [see (69.3)] is here not applicable to the potentials 
<p, A, since they do not satisfy the same auxiliary conditions as were imposed on 



them in §69. 



§82 



Dipole radiation 



257 



defined as the amount of energy passing in unit time through the 
element df = R% do of the spherical surface with centre at the origin 
and radius Ro. This quantity is clearly equal to the energy flux density 
S multiplied by df, i.e. 

dI=c^Rldo. (81.4) 

Since the field H is inversely proportional to R 0 , we see that the amount 
of energy radiated by the system in unit time into the element of solid 
angle do is the same for all distances (if the values of t—R 0 /c are the 
same for them). This is, of course, as it should be, since the energy 
radiated from the system spreads out with velocity c into the surround- 
ing space, not accumulating or disappearing anywhere. 



§82. Dipole radiation 

The time r-n/c in the integrands of the expressions (81.1) and (81 2) 
for the retarded potentials can be neglected in cases where the distri- 
bution of charge changes little during this time. It is easy to find the 
conditions for satisfying this requirement. Let T denote the order of 
magnitude of the time during which the distribution of the charges 
in the system changes significantly. The radiation of the system will 
obviously contain periods of order T (i.e. frequencies of order l/T). 
We further denote by a the order of magnitude of the dimensions of 
the system. Then the time r-n/c ~ ajc. In order that the distribu- 
tion of the charges in the system shall not undergo a significant 
change during this time, it is necessary that ajc <sc T. But cT is just 
the wavelength X of the radiation. Thus the condition a <zcT can 
be written in the form 

a«A, (82.1) 

that is, the dimensions of the system must be small compared with 
the radiated wavelength. 
This condition can be written in still another form by noting that 
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T ~ a/v, so that A ~ ca/v, if v is of the order of magnitude of the 
velocities of the charges. From a <sc A, we then find 

v <k c, (82.2) 

that is, the velocities of the charges must be small compared with the 
velocity of light. 

We shall assume that this condition is fulfilled, and consider the 
radiation at distances from the radiating system large compared with 
the wavelength (and consequently, in any case, large compared with 
the dimensions of the system). As was pointed out in §81, at such 
distances the field can be considered as a plane wave, and therefore 
in determining the field it is sufficient to calculate only the vector 
potential. 

The vector potential (81.2) of the field now has the form 

j,<dF, (82.3) 



cR 0 



where the time t' = t—Ro/c now no longer depends on the variable 
of integration. Substituting j = q\, we rewrite (82.3) in the form 

(the summation goes over all the charges of the system; for brevity, 
we omit the index t' — all quantities on the right-hand side of the 
equation refer to time t'). But 

E CT = -ft E er = * 

where d is the dipole moment of the system. Thus, 

1 



cR 0 



d. (82.4) 



With the aid of formulae (81.3) we find that the magnetic field is 
equal to 

H = ^SXii, (82.5) 
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and the electric field to 

E=^(dXn)Xn. (82.6) 

We note that, in the approximation considered here, the radiation 
is determined by the second derivative of the dipole moment of the 
system. Radiation of this kind is called dipole radiation. 

Since d = Zer, 3 = Zev. Thus the charges can radiate only if they 
move with acceleration. Charges in uniform motion do not radiate. 
This also follows directly from the principle of relativity, since a 
charge in uniform motion can be considered in the inertial frame in 
which it is at rest, and a charge at rest does not radiate. 

Substituting (82.5) in (81.4), we get the intensity of the dipole 
radiation : 

d/ -i (aXn)2d0= 4^ sin2ed °' (82J) 

where d is the angle between 3 and n. This is the amount of energy 
radiated by the system in unit time into the element of solid angle do. 
We note that the angular distribution of the radiation is given by 
the factor sin 2 6. 

Substituting do = 2rt sin 6 dd and integrating over d from 0 to n, 
we find for the total radiation 

1=^- (82.8) 

If we have just one charge moving in the external field, then d = ex 
and 3 = ew, where w is the acceleration of the charge. Thus the total 
radiation of the moving charge is 

/=*g-. (82.9) 

We can perform a spectral resolution of the radiation emitted by 
the system. It is obvious that the emission of a particular mono- 
chromatic component of the radiation is due to the corresponding 
component of the dipole moment of the system, d(f). In treating the 
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problem we must distinguish the cases of expansion in a Fourier 
series or in a Fourier integral. 

If the charges execute a periodic motion (with frequency coo), then 
the dipole moment (and with it the radiation field) must be expanded 
in Fourier series. According to the general formula (72.4) the intensity 
of a monochromatic component (with frequency co = ncoo) is obtained 
from the formula for the average intensity of the radiation, 

7=3^^ (82.10) 

by replacing the mean square A 2 by twice the squared modulus of the 
corresponding Fourier component : 

h = ^\A n \ 2 . 

We can express the Fourier components of the vector A(t) in terms 
of the Fourier components of the vector d(f). To do this we note that 
each term in the expansion of d(f) as a series or as an integral must 
be obtained by a time differentiation of the corresponding term in the 
expansion of d(f), i.e., 

d 2 

'A m e~ ia " = ^ (i a e- im ') = - co 2 d m e- ia ", 



so that 
Therefore 



■co 2 i a . (82.11) 



I n = -^L\& n \ 2 . (82.12) 



We have to work with the Fourier integral expansion in the case 
of radiation accompanying the collision of charged particles (brems- 
strahlung). Here the quantity of interest is the total amount of energy 
radiated during the course of the collision. Let d& m be the energy 
radiated in the form of waves with frequencies in the interval between 
co and co+dco. According to (72.8) we obtain it from the formula for 
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the total energy of the radiation, 



A<5 = 



J'* "3? J* 



(82.13) 



by replacing the integral by the expression 2|dJ 2 da>/27r: 



A5- = ia-l"da» = IdLPda,. 



(82.14) 



We note that a closed system consisting of particles all with the 
same charge-to-mass ratio cannot emit dipole radiation : for such a 
system the dipole moment 



where the constant is the charge-to-mass ratio common to all the 
particles. But Zmt = R Em, where R is the position vector of the 
centre of mass of the system (all velocities v «: c, so that non-relativ- 
istic mechanics is applicable). Thus A is proportional to the acceleration 
of the centre of mass, and is thus zero, since the centre of mass moves 
uniformly. 

If dipole radiation is absent, we must go to higher terms in the 
expansion of the potentials in powers of ajX in order to find the energy 
radiated by the system. In the next approximation (after the dipole 
radiation) we find radiation resulting from the oscillations of both 
the electric quadrupole moment of the system and its magnetic moment. 



Problem 1. Find the radiation from a dipole d, rotating in a plane with constant 
angular velocity CI. 

Solution. Choosing the plane of the rotation as the xy plane, we have 



Since these functions are monochromatic, the radiation is also monochromatic, 
with frequency to = fi. From formula (82.7) we find for the angular distribution 



t In all the problems it is assumed that all particle velocities v •« c. 




wr = constant X £ wr > 



PROBLEMS 1 



d x = d 0 cos Clt, d t = d a sin Clt. 
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of the radiation (averaged over the period of the rotation) 

d7= -^-(1 + cos 2 *) do, 
one" 

where ■& is the angle between the direction n of the radiation and the z axis. The 
total intensity is 



3c 3 



Problem 2. Determine the total radiation in the head-on collision of two particles 
repelling one another. 

Solution. Choosing the coordinate origin at the centre of mass of the particles, 
we find for the dipole moment of the system 

d = efr+ef, = 188 r = ^ r, 
m 1 + m t \rrii m 2 / 

where the subscripts 1 and 2 refer to the two particles, r = Ti— r 2 is the radius 
vector joining them, and fi = m 1 /n x /(/n 1 +'n2) is their reduced mass. The equation 
of the relative motion of the particles is 

e 1 e 2 r 

P = /"V = — 3 - 

^ 0). According to (82.13), the total energy of the bremsstrahlung is 

A,5= 2£_/*L__ e^y J ., d/= l_le^_ 7 d, 

3c 3 Xni! m 2 J J 3c 3 \rrii m 2 / J r 4 

— oo — oo 

For a head-on collision the relative velocity v is determined from 
2 + r ' 2 ' 

where is the velocity at infinity. Substituting At = dr/v in the integral, replacing 
the integral over / by an integral over r from oo to r min = 2e 1 e 2 / 1 uvL and from 

/"min tO oo, 



(1) 



J*" J* 

— oo 

Computing the integral, we find 



dr 



V^oo-^iez/i"/-) 

r min 



45c s e!e 2 V/Hj m 2 / 

Problem 3. Determine the total radiation during the passage of one charge past 
another, if the velocity is so large (though still small compared with c) that the 
deviation from straight-line motion can be assumed to be small. 

Solution. The angle of deflection is small if fiv 2 » e^/g (i.e. the kinetic energy 
(iv* /2 is large compared with the potential energy, of order e x e 2 lo). For rectilinear 
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motion with speed v, r = vTp 2 where q is the impact parameter. Substituting 
in formula (1) of Problem 2 and calculating the integral, we find 

A<5 = rcfeieii) 2 / £i_ _ e 2 \ 2 
SucV \m 2 m 2 / 

Problem 4. Find the formula for the spectral resolution of bremsstrahlung in the 
limit of low frequencies.''' 

Solution. In the integral 

d w = j d(f)e to 'df = p(lL-JL^ j* ve to 'df, 

the acceleration v is significantly different from zero only during a time interval 
~t. Therefore for frequencies to « 1/t we may assume that in the integrand 
cot « 1 and accordingly put e ,a " 1 . Then 

= Fvd,= (- ei -^Up, 

Vmj m 2 / J m 2 / 

— oo 

where Ap is the change in the momentum of the relative motion p = fjy because of 
the collision. According to (82.14) the energy radiated in the frequency range dco is 

dSa, = (~ - (Ap) 2 dco. 

We note that this distribution is independent of frequency, i.e. d^ u) /dco tends to a 
constant limit as to -* 0. 

Problem 5. Determine the intensity of the radiation from a charge moving in a 
circular path in a constant uniform magnetic field. 

Solution. From formula (82.9) we find 

3IWV 



§83. Radiation from a rapidly moving charge 

Now let us consider a charged particle moving in an external field 
with a velocity which is not small compared with the velocity of light. 
To solve this problem, it is convenient to use the Lienard-Wiechert 

t In the spectral resolution of the bremsstrahlung, the main part of the intensity 
is at frequencies cu ~ 1/t, where r is of the order of the duration of the collision. 
Correspondingly, by small frequencies we here mean to •« 1/t. 
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expressions for the fields, (80.8) and (80.9). At large distances we 
must retain only the term of lowest order in 1/R [the second term in 
(80.8)]. Introducing the unit vector n in the direction of the radiation 
(R = nR), we get the formulae 

e nX{(n-v/c)Xw} _ , n 

E ~^R (l-n.v/c)3 ' — nXE, (83.1) 

where all the quantities on the right-hand sides of the equations refer 
to the retarded time t' = t—R/c. 

The intensity radiated into the solid angle do is proportional to E 2 . 
The angular distribution of the radiation is, in general, quite com- 
plicated. In the ultra-relativistic case, 1 —v/c <k 1, it has a characteristic 
property which is related to the presence of high powers of the 
difference 1 — v-n/c in the denominators: the intensity is large within 
the narrow range of angles in which the difference 1— v-n/c is small. 
Denoting by 6 the small angle between n and v, we have 

l-^cos^l-| + ^ % i(l-^ + ^. (83.2) 

This difference is small if 

0~V[l-(*> 2 /c 2 )]. (83.3) 

Thus an ultra-relativistic particle radiates mainly along the direction 
of its own motion, within the small range (83.3) of angles around the 
direction of its velocity. 

The amount of energy radiated during time df into the element of 
solid angle do is 

^EHPdojdt. (83.4) 

In calculating the intensity of the radiation we must now distinguish 
between two ways of defining it. 

In (83.4) 6t is the time interval at the field point, so that the express- 
ion in parentheses is the intensity defined as the energy of the radia- 
tion received by the observer in unit time. But because of the retardation 
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during the propagation of the wave from the radiating particle to the 
field point, the interval df is not the same as the time interval df' 
during which the energy (83.4) was radiated by the moving particle. 
According to (80.7) we have 




(83.5) 



If we define the intensity as the energy radiated by the particle per 
unit time, it is therefore equal to 

d/=£*(l~^-)*do. < 816 > 

For v <k c (as was assumed in§82) the factor 1 — n-y/c can be replaced 
by unity, so that the two definitions of the intensity coincide. 



PROBLEM 

Determine the intensity of the radiation from an ultra-relativistic particle, 
moving in a circular path in a constant uniform magnetic field. 

Solution. For the case where the acceleration and the velocity are at right angles, 
the calculation from (83.1) and (83.6) gives 



, e 2 n> 2 f 1 (1 — v z /c 2 ) sin 2 0 cos 2 <£ \ 

~~ 4nc^ \[~n=0Vc) cos Op [l-OVc) cos df J ' 



where 6 is the angle between n and v, and <p is the azimuthal angle of the vector n 
with respect to the plane passing through v and w. In the ultra-relativistic case the 
small-angle region is the most important. In the region of small 6, 

2eV= f 1 4(l-t> 2 /c 2 )fl 2 \ 

n<? tll-OrVc^ + flT [l-(«*/c*) + 0 2 ] 5COS a °' 

where the element of solid angle do = sin 0 dd d<p «s ddO d<p. Because of the rapid 
convergence of the integral over 0, in calculating the total intensity we can extend 
this integral from 0 to oo. The result is 

3mV[l-(w ! /c ! )] ' 

In this formula we have inserted the expression for the acceleration during circular 
motion in a magnetic field H: 

evH //, i> 2 \ eff 



w = 

mc 
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§84. Radiation damping 

The radiation of electromagnetic waves by moving charges results 
in a loss of energy by them. The reaction of this loss on the motion 
of the charges can be described by introducing corresponding "fric- 
tional forces" f into the equations of motion. 

Let us consider a system of charges executing a stationary motion 
with non-relativistic velocities (v <k c). The average loss of energy 
from the system (per unit time) is equal to the average intensity of 
the radiation (82.10). We choose the forces f so that this energy loss 
can be represented as the average work of these forces. The work of the 
force f per unit time is equal to the product f -v, where v is the velocity 
of the particle. Thus we must have 

Ef^ = -^ (84.1) 

(where the sum goes over all the particles of the system). 
It is easy to see that this requirement is satisfied by the forces 

t.=£d. (84.2) 



For we have 

Z ^ = i v. = i = i -sr <*•*> - 3?*- 



On averaging, the first term, which contains a total time derivative, 
vanishes (cf. the footnote on p. 205), and we arrive at (84.1). The 
forces (84.2) are called radiation damping or Lorentz frictional forces. 

Radiation damping also occurs for the motion of a single particle 
in an external field. In this case d = ev, and the equation of motion, 
including the forces (84.2), has the form 

e 2e 2 
my = eE+--vXH+-^3 v. (84.3) 



Nevertheless, we must keep in mind that the description of the 
action of the charge "on itself" with the aid of the damping force is 
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somewhat unsatisfactory, and contains contradictions. For, in the 
absence of an external field, equation (84.3) reduces to 

. 2e 2 . 

This equation has, in addition to the trivial solution v = constant, 
another solution in which the acceleration v is proportional to exp 
(3mc 3 t/2e 2 ), that is, increases indefinitely with the time. This means, 
for example, that a charge passing through any field, upon emergence 
from the field, would have to be infinitely "self-accelerated". The 
absurdity of this result is evidence for the limited applicability of 
formula (84.3). 

One can raise the question of how electrodynamics, which satisfies 
the law of conservation of energy, can lead to the absurd result that 
a free charge increases its energy without limit. Actually the root of 
this difficulty lies in the earlier remarks (§60) concerning the infinite 
electromagnetic "intrinsic mass" of elementary particles. When in the 
equation of motion we write a finite mass for the charge, then in 
doing this we essentially assign to it formally an infinite negative 
"intrinsic mass" of non-electromagnetic origin, which together with 
the electromagnetic mass should result in a finite mass for the particle. 
Since, however, the subtraction of one infinity from another is not 
an entirely correct mathematical operation, this leads to a series of 
further difficulties, among which is the one mentioned here. 

Since the damping force thus leads to contradictory results, express- 
ion (84.2) is valid only if this force is small compared with the force 
exerted on the charge by the external field. 

§85. Scattering by free charges 

If an electromagnetic wave is incident on a system of charges, then 
under its action the charges are set in motion. This motion in turn 
produces radiation in all directions ; there occurs, we say, a scattering 
of the original wave. 

The scattering is conveniently characterised by the ratio of the 
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amount of energy emitted by the scattering system in a given direction 
per unit time, to the energy flux density of the incident radiation. This 
ratio has the dimensions of area, and is called the scattering cross- 
section (cf. §15). 

Let dl be the energy radiated by the system into solid angle do per 
unit time for an incident wave with Poynting vector S. Then the cross- 
section for scattering (into the solid angle do) is 



(the bar over a symbol means a time average). The integral a of do- 
over all directions is the total scattering cross-section. 

Let us consider the scattering produced by a free charge at rest. 
Suppose there is incident on this charge a plane monochromatic 
linearly polarised wave. Its electric field can be written in the form 



We shall assume that the velocity acquired by the charge under the 
influence of the incident wave is small compared with the velocity of 
light (which is almost always the case). Then we can consider the force 
acting on the charge to be eE, while the force (e/c)vXH due to the 
magnetic field can be neglected. In this case we can also neglect the 
effect of the displacement of the charge during its oscillations under 
the influence of the field. If the charge executes oscillations about the 
origin, then we can assume that the field which acts on the charge is 
at all times the same as that at the origin, that is, 



do- = dl/S 



(85.1) 



E = E 0 cos (cot— k-r+a). 



E = Eo cos (cot+a). 



Since the equation of motion of the charge is 



mi — eE 



and its dipole moment d = er, then 

d = e 2 E//M. 



(85.2) 



To calculate the scattered radiation, we use formula (82.7) for 
dipole radiation (this is justified, since the velocity acquired by the 
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charge is assumed to be small). We also note that the frequency of the 
wave radiated by the charge (i.e. scattered by it) is clearly the same as 
the frequency of the incident wave. 
Substituting (85.2) in (82.7), we find 

d/ = 1^5( E >< D ) 2do - 
4jrm 2 c 3 

On the other hand, the Poynting vector of the incident wave is 

S = c£»/4w. (85.3) 

From this we find, for the cross-section for scattering into the solid 
angle do, 

da = (e z /mc*f sin 2 6 do, (85.4) 

where 6 is the angle between the direction of scattering (the vector n), 
and the direction of the electric field E of the incident wave. We see 
that the scattering cross-section of a free charge is independent of 
frequency. 

To determine the total cross-section a, we choose the polar axis 
along E. Then do = sin 6 dd d0; substituting this and integrating 
with respect to 6 from 0 to n, and over 0 from 0 to 2jt, we find 

— r£)" 

(This is Thomson's formula.) 

PROBLEMS 

Problem 1 . Find the cross section da for the scattering of an unpolarised wave 
(natural light). 

Solution. We must average (85.4) over all directions of the vector E in the plane 
perpendicular to the direction of propagation of the incident wave (the direction 
of the wave vector k). We choose the z axis along the direction of k, and the x axis 
along E. Then the cosine of the angle d between the directions of n and E, i.e. the 
projection of the unit vector n on the x axis, is cos 0 = sin & cos cp, where & and rp 
are the polar angle and azimuth of the direction n. Averaging over all directions 
of E in the plane perpendicular to k is equivalent to averaging over the azimuth 4>. 
We find 

sE?0 = 1 - i sin 2 0 = |(1 + cos 2 0), 
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and, substituting in (85.4), we obtain 

do = KeV>nc 2 ) 2 (l + cos 2 0) do. 
Problem 2. Find the frequency a>' of light scattered by a moving charge. 

Solution. In the frame of reference where the particle is at rest (the rest frame of 
the particle), the frequency of the light will not be changed on scattering: o/ = co. 
This relation can be written in invariant form : 

where kf and k'f are the wave four-vectors of the incident and scattered light, and 
uP is the four-velocity of the particle (the only non-zero component in the rest frame 
is u° = 1). Writing out this equality in an arbitrary frame of reference (in which 
the particle moves with velocity v), we find 



-~cos 0'j = co^l — — cos oj, 



where 0 and 0' are the angles made by the directions of the incident and scattered 
waves with the direction of v. 

Problem 3. Find the cross-section for scattering of a linearly polarised wave by 
a space oscillator — a charge executing small oscillations (under the influence of 
some elastic force) with frequency co„. Include the effects of radiation damping. 

Solution. We write the equation of motion of the oscillator in the incident 
wave in the form 

e 2e* - 

r+cogr = — •E^-i«»+-—— r. 
m imc 3 

In the damping force (the second term on the right) we can substitute approximately 
r = -co§r; then we find 

r+yr+cu;jr = (elm) E 0 e-'°", y = (2e 2 /3mc 3 )co§. 
From this we obtain for forced oscillations 

e e-'«" 

r ^0 — i 2 — = • 

The further calculation is done as in the text (the computation of the average of the 
square of a complex quantity should be done as in the footnote on p. 219). The 
result for the scattering cross-section is 



(mc 2 ) 



(cog-aj^+coV' 



§86. Scattering by a system of charges 

The scattering of electromagnetic waves by a system of charges 
differs from the scattering by a single charge (at rest), first of all in 
the fact that, because of the presence of internal motion of the charges 
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in the system, the frequency of the scattered radiation can be different 
from the frequency of the incident wave. Namely, in the spectral 
resolution of the scattered wave there appear, in addition to the fre- 
quency co of the incident wave, frequencies co' differing from co by 
one of the eigenfrequencies of motion of the scattering system. The 
scattering with changed frequency is called incoherent (or Raman) 
scattering, in contrast to the coherent scattering without change in 
frequency. 

Assuming that the field of the incident wave is weak, we can repres- 
ent the current density in the form j = jo+j', where j 0 is the current 
density in the abse'nce of the external field, and j' is the change in the 
current under the action of the incident wave. Correspondingly, the 
vector potential (and other quantities) of the field of the system also 
has the form A = A 0 +A', where A 0 and A' are determined by the 
currents jo and j'. A' describes the wave scattered by the system, and 
is given in terms of j' by formula (81.2): 



A' = 4- 

cR 0 



j;-^ +I ,, c dF. (86.1) 



Let us consider the two limiting cases where the frequency co of the 
scattered waves is small or large compared with the fundamental 
modes of the system. The latter are of order co 0 ~ v/a, where v is the 
velocity of the charges in the system and a its size. We shall also assume 
that the velocities v <sc c. 

We start with the case where 

co <sc coo ~ v/a. (86.2) 

The scattering can contain both coherent and incoherent parts, but we 
shall here consider only the coherent scattering. 

Under the condition (86.2) we can make the same approximations 
in formula (86.1) as we did in §82. In other words, the scattered 
radiation will be dipole radiation. If this is so, then the intensity of 
its spectral component with frequency co will be proportional to the 
square of the Fourier component | | 2 = co 4 1 d^, | 2 , where d' is the change 
in the dipole moment under the influence of the incident wave. 
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If the total charge of the system is zero (a neutral atom or molecule), 
then for co — 0, d^, approaches a constant limit (if the sum of the 
charges were different from zero, then for co = 0, i.e. for a constant 
field, the system would begin to move as a whole). Therefore for low 
frequencies co we can consider as independent of frequency. Then 
the intensity of the scattered wave, and, with it, the cross-section for 
scattering are proportional to the fourth power of the frequency of the 
incident radiation : 

0coh = constantXco 1 . (86.3) 
We proceed now to the opposite case of high frequencies : 

co » co 0 ~ v/a. (86.4) 

From this condition, the periods of the motion of the charges in 
the system are large compared with the period of the wave. Therefore 
during a time interval of the order of the period of the wave, the 
motion of the charges in the system can be considered uniform. This 
means that, in considering the scattering of short waves, we need not 
take into account the interaction of the charges in the system with 
each other, that is, we can consider them as free. 

Thus in calculating the velocity v' acquired by a charge in the field 
of the incident wave, we can consider each of the charges in the system 
separately, and write for it an equation of motion of the form 

m dv'/d/ = eE = eE 0 e -''<""- k ' r \ 

where k = (co/c)n is the wave vector of the incident wave. The position 
of the charge is, of course, a function of the time. In the exponent 
on the right-hand side of this equation the time rate of change of the 
first term is large compared with that of the second (the first is co, while 
the second is of order kv ~ v(co[c) <scco). Therefore, in integrating the 
equations of motion, we can consider the r on the right-hand side as 
constant. Then 

v' = — E 0 e-'(«"- k - r >. (86.5) 

icom 

For the vector potential of the scattered wave (at large distances from 
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the system), we have, changing (86.1) to a sum, 

where n' is a unit vector in the direction of scattering. Substituting 
(86.5), we find 

A' = e-^'-^Eo V — e-"', (86.6) 

icRtfa m 

where q = k'— k is the difference between the wave vector k = (co/c)a 
of the incident wave, and the wave vector k' = (a>/c)a' of the scattered 
wave. f The value of the sum in (86.6) must be taken at the time /' = 
= t—Ro/c (for brevity as usual, we omit the index /' on r); the change 
of r in the time t-n'/c can be neglected in view of our assumption 
that the velocities of the particles are small. The absolute value of 
the vector q is 

q = 2(«/c) sin (86.7) 

where # is the scattering angle. 

For scattering by an atom (or molecule), we can neglect the terms 
in the sum in (86.6) which come from the nuclei, because their masses 
are large compared with the electron mass. Below we shall be looking 
at just this case, so that we take the factor e 2 /m outside the sum- 
mation sign, and understand by e and m the charge and mass of the 
electron. 

For the field H' of the scattered wave we find from (81.3): 

H' = - Eo * B e-'<"('-*o"> — V e-"'. (86.8) 
The energy flux into an element of solid angle in the direction n' is 

• ^- /} o do -8^^ Xn ')^ e - ,,,r | 2do - 



t Strictly speaking, the wave vector k' = co'n'/c, where the frequency to' of the 
scattered wave may differ from co. However, in the present case of high frequencies 
the difference co'—co ~ a>„ can be neglected. 
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Dividing this by the mean energy flux (c/8ji) |E 0 1 2 of the incident wave 
(cf. the footnote on p. 219) and introducing the angle 6 between the 
direction of the field E of the incident wave and the direction of 
scattering, we finally obtain the scattering cross-section in the form 



The bar means a time average, i.e. an average over the motion of the 
charges in the system; it appears because the scattering is observed 
over a time interval large compared with the periods of motion of the 
charges in the system. 

For the wavelength of the incident radiation, there follows from the 
condition (86.4) the inequality X <k (c/v)-a. As for the relative values 
of X and a, both the limiting cases X :» a and X <sc a are possible. In 
both these cases the general formula (86.9) simplifies considerably. 

In the case of X :» a, in the expression (86.9) q-r <sc 1, since q ~ l/X, 
and r is of the order of a. Replacing e _,q r by unity in accordance 
with this, we have 



that is, the scattering is proportional to the square of the atomic 
number Z. 

We now go over to the case of X <sc a. In the square of the sum 
which appears in (86.9), in addition to the square modulus of each 
term, which is unity, there appear products of the form e' q (r > -r!) . 
In averaging over the motion of the charges, i.e. over their mutual 
separations, n— r 2 takes on values in an interval of order a. Since 
q ~ l/X, X <sca, the exponential factor e' fl ' (r,_r,) is a rapidly oscillating 
function in this interval, and its average value vanishes. Thus, for 
X <sc a, the scattering cross-section is 




(86.9) 




(86.10) 




>2 \ 2 



(86.11) 
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that is, the scattering is proportional to the first power of the atomic 
number. 

The cross-sections (86.9)-(86. 1 1) contain both coherent and incoher- 
ent parts. To determine the coherent scattering cross-section, we 
must separate the part of the field of the scattered wave that has 
frequency a>. The expression (86.8) for the field depends on the time 
through the factor e~'°" and also through the time dependence of the 
sum 27e _ ' q r . This latter dependence also has the consequence that the 
field of the scattered wave contains not only the frequency a> but other 
(slightly different) frequencies. The part of the field that has frequency 
co (i.e. depends on the time only through the factor e~'"") is obtained 
by taking the time average of 27e~' q r . The corresponding expression 
for the coherent scattering cross-section da mh differs from the total 
cross-section da; instead of the average value of the squared modulus 
of the sum it contains the squared modulus of the average value of the 
sum: 

da cob = (^j |F(q)| 2 sin 2 6 do, (86.12) 

where 

F(q) = 27e r ^ r . (86.13) 

The function F(q) is called the atomic form factor. It is worth 
noting that it is just the spatial Fourier component of the average 
charge distribution in the atom, q(i) : 

eF(q) = j e(r) r" 'dK (86. 14) 

This is easily understood if from the start we write the unaveraged 
density n(r) as a sum of delta functions (cf. (54.1)). 
If X » a, we can again replace e~' qr by unity, so that 

dffcoh = ( z ^ 2 sin2 9 do - ( 86 - 15 ) 

Comparing this with the total cross-section (86. 10), we see that d<x coh = 
= da, that is, all the scattering is coherent. 
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If A <k a, then when we average in (86.13) all the terms of the sum 
(being mean values of rapidly oscillating functions of the time) vanish, 
so that dcr coh = 0. Thus in this case the scattering is completely incoher- 
ent. 
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of rigid body 81 



L system 142 
Laboratory system 40, 142 
Lagrange's equations 6 
Lagrangian 5-7 

in central field 31 

in electromagnetic field 153,177 

of free particle 10-12, 137 

for one-dimensional motion 28 

of particle in field 15 

of rigid body 81 

of system of particles 12-16 

for two particles 30 
Laplace's equation 191 
Larmor 

frequency 212 

precession 212 
Least action, principle of 4-6, 103n., 
136 

Length, proper 126 
Lienard-Wiechert potentials 253 
Light 

aberration of 129 

cone 121 

pressure of 218 

velocity of 113, 141n. 
Linear oscillations 74 
Lorentz 

condition 249 

contraction 127 

force 156 

frictional force 266 

gauge 249 

transformation 126 



Magnetic field 1 56ff. 
constant 205-7 

constant uniform 163-6, 210-12 
Magnetic moment 209 
Mass 

additivity of 22 

centre of 22 

of particle 1 1 

reduced 30 
Maxwell stress tensor 190 
Maxwell's equations 173,184 
Metric tensor 133 

Mixed components of four-tensor 1 32 
Molecules, vibration of 62-3 
Moment 

dipole 199 

of force 91 

of inertia 82-7 

magnetic 209 

of momentum, see Angular momen- 
tum 

quadrupole 201-2 

Momentum 
angular, see Angular momentum 
conservation of 20, 22, 27 
density in electromagnetic field 188 
in electromagnetic field 153 
flux in electromagnetic field 189 
four-dimensional 140-1 
moment of, see Angular momentum 
of a particle 20, 137 
of a system 20-3 

Monochromatic electromagnetic waves 
218-21 

Motion, equations of, see Equations of 
motion 

Motion, integrals of, see Integrals of 
the motion 

Newtonian mechanics 9 
Newton's 

equations 13 

third law 21 
Non-inertial frame of reference 97-1 02 
Non-linear oscillations 74-6 
Normal 

coordinates 61-4 
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Normal (cont.) 

oscillations 61-4 
Notation xiii, lln., 154n. 
Null four- vector 131 

One-dimensional 

motion 28-30 

oscillations 50-8, 71-4 

oscillator 51n., Ill 
Optics, geometrical 228-34 
Oscillations 

anharmonic 74-6 

linear 74 

non-linear 74-6 

small, see Small oscillations 
Oscillator 

field 247 

one-dimensional 5 In., Ill 
space 35, 64, 165, 270 

Parametric resonance 71-4 
Particle 3 
Past, absolute 121 
Pendulum 15-16, 63^1, 74, 85 

Foucault's 101-2 
Phase 

of small oscillations 51 

path 110 

of wave 219 
Plane electromagnetic waves 215-21 
Poisson's equation 191 
Polarisation of electromagnetic waves 

221, 224-7 
Position vector 3 
Potential 

dipole 200 

energy 12-15, 19, 33, 100, 200 

four-dimensional 152 

Lienard-Wiechert 253 

retarded 251 

scalar 153 

vector 153 

well 29 
Poynting vector 185 
Precession 

Larmor 212 

regular 89 



Pressure of light 218 

Principle, Hamilton's, see Principle of 

least action 
Principle of least action 4-6, 103n., 

136 
Proper 

length 126 

time 122-3 

volume 127 
Pseudo-Euclidean geometry 117n. 

Quadrupole 
moment 201-2 
potential 200 

Radiation 248-76 

damping 266 
Radius vector, see Position vector 
Reactions 93 
Reduced mass 30 
Relativistic mechanics 9n., 113 
Relativity principle 8 

Einstein's 113 

Galileo's 9 
Remote, absolutely 121 
Resolution, spectral 222^1, 259-61 
Resonance 54-5 

parametric 71-4 
Rest, system at 22 
Retarded potentials 251 
Reversibility of motion 13 
Rigid body 77, 195 

angular momentum of 87-90 

equation of motion of 90-92 

motion of 77-102 
Rolling of bodies in contact 93-6 
Rotator 83, 88 
Rough surfaces 93 
Rutherford's formula 48 

Scalar potential 153, 158-60 

Scattering 
cross-section 46-9, 268 
of particles 44-9, 144-8 
of waves by charges 267-76 

Sectorial velocity 32 
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Self-energy 194-5 
Signal 112 

Sliding of bodies in contact 93 
Small oscillations 50-76 

amplitude of 51, 52 

angular frequency of 51 

damped 64-70 

forced 53-8, 68-70 

free 50-3, 58-64 

frequency of 51 

normal 61-4 

one-dimensional 50-8 

phase of 51 
Smooth surfaces 93 
Space 

components of four- vector 131 

homogeneity of 8, 19 

isotropy of 8, 23 

oscillator 35, 64, 165, 270 
Space-like 

four- vector 131 

interval 119 
Spectral resolution 222-4, 259-61 
Spherical top 83, 88 
Stopping points 29 
Summation convention 81n., 130 
Superposition principle 175 
Symmetric four-tensor 132 
Symmetrical top 82, 89-90 



Tensor 

electromagnetic field 169-71 

four-dimensional 132-5 

inertia 82-4 

metric 133 

polarisation 226 
Thomson's formula 269 
Time 

absoluteness of 9, 1 14 
component of four-vector 131 
homogeneity of 8, 1 8 
proper 122-3 
reversal 13, 157-8 
Time-like 

four-vector 131 
interval 119 



Top 

asymmetrical 82 
spherical 83, 88 
symmetrical 82, 89-90 
Torque 91 

Trace of four-tensor 133 
Translational velocity 78 
Transverse waves 217 
Turning points 29, 33 
Two-body problem 30 



Ultra-relativistic case 139 
Uniform field 15,161-8 



Variation 

of function 5 

of integral 5 
Vector 

four-dimensional, see Four-vector 
potential 153, 158-60 
Velocity 3 
angular 78-80 
composition of 9, 114, 128 
four-dimensional 140 
generalised 3 
of light 113, 141n. 
of propagation of interactions 1 12- 
15 

sectorial 32 
transformation of 127-9 
translational 78 
Volume, proper 127 



Wave 

equation 214 

surface 228 

vector 219 

zone 256 
Wavelength 219 
Waves, electromagnetic 213-76 

plane 215-21 
World 

line 116 

point 116 



